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THE OB RECEPTOR AND METHODS OF DIAGNOSING AND TREATING WEIGHT 

5 I. . INTRODUCTION 

The present invention relates to the discovery, 
identification and characterization of nucleotides that 
encode Ob receptor (ObR) , a receptor protein that 
participates in mammalian body weight regulation. The 

10 invention encompasses obR nucleotides , host cell 
expression systems, ObR proteins, fusion proteins, 
polypeptides and peptides, antibodies to the receptor, 
transgenic animals that express an obR transgene, or 
recombinant knock-out animals that do not express the 

15 ObR, antagonists and agonists of the receptor, and other 
compounds that modulate obR gene expression or ObR 
activity that can be used for diagnosis, drug screening, 
clinical trial monitoring, and/or the treatment of body 
weight disorders, including but not limited to obesity, 

20 cachexia and anorexia. 

II. . BACKGROUND OF THE INVENTION 
Obesity represents the most prevalent of body 
weight disorders, and it is the most important 
nutritional disorder in the western world, with estimates 

25 of its prevalence ranging from 3 0% to 50% within the 

middle-aged population. Other body weight disorders, such 
as anorexia nervosa and bulimia nervosa which together 
affect approximately 0.2% of the female population of the 
western world, also pose serious health threats. 

3 0 Further, such disorders as anorexia and cachexia 

(wasting) are also prominent features of other diseases 
such as cancer, cystic fibrosis, and AIDS. 

Obesity, defined as an excess of body fat relative 
to lean body mass, also contributes to other diseases. 

3 5 For example, this disorder is responsible for increased 



WO 97/19952 PCT/US96/19128 

- 2 - 

incidences of diseases such as coronary artery disease, 
stroke, and diabetes. (See, e.g. . Nishina, P.M. et al. , 
1994, Metab. 43 i 554-558 . ) Obesity is not merely a 
behavioral problem, i.e. , the result of voluntary 
5 hyperphagia. Rather, the differential body composition 
observed between obese and normal subjects results from 
differences in both metabolism and neurologic/metabolic 
interactions. These differences seem to be, to some 
extent, due to differences in gene expression, and/or 

10 level of gene products or activity (Friedman, J.M- et 
al., 1991, Mammalian Gene 1:130-144). 

The epidemiology of obesity strongly shows that 
the disorder exhibits inherited characteristics 
(Stunkard, 1990, N . Eng. J. Med. 322 : 1483) . Moll et al. 

15 have reported that, in many populations, obesity seems to 
be controlled by a few genetic loci (Moll et al. 1991, 
Am. J. Hum. Gen. 4j>:1243). In addition, human twin 
studies strongly suggest a substantial genetic basis in 
the control of body weight, with estimates of 

20 heritability of 80-90% (Simopoulos, A. P. & Childs B., 
eds. , 1989, in "Genetic Variation and Nutrition in 
Obesity", World Review of Nutrition and Diabetes 63., S. 
Karger, Basel, Switzerland; Borjeson, M. , 1976, Acta. 
Paediatr. Scand . 65:279-287). 

25 Studies of non-obese persons who deliberately 

attempted to gain weight by systematically over-eating 
were found to be more resistant to such weight gain and 
able to maintain an elevated weight only by very high 
caloric intake. In contrast, spontaneously obese 

3 0 individuals are able to maintain their status with normal 
or only moderately elevated caloric intake. In addition, 
it is a commonplace experience in animal husbandry that 
different strains of swine, cattle, etc., have different 
predispositions to obesity. Studies of the genetics of 

3 5 human obesity and of models of animal obesity demonstrate 
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that obesity results from complex defective regulation of 
both food intake, food induced energy expenditure and of 
the balance between lipid and lean body anabolism. 

There are a number of genetic diseases in man and 
5 other species which feature obesity among their more 
prominent symptoms, along with, frequently, dysmorphic 
features and mental retardation. For example, Prader- 
Willi syndrome (PWS) affects approximately 1 in 20,000 
live births, and involves poor neonatal muscle tone, 

10 facial and genital deformities, and generally obesity. 

In addition to PWS, many other pleiotropic 
syndromes which include obesity as a symptom have been 
characterized. These syndromes are more genetically 
straightforward, and appear to involve autosomal 

15 recessive alleles. The diseases, which include, among 
others, Ahlstroem, Carpenter, Bardet-Biedl, Cohen, and 
Morgagni-Stewart-Monel Syndromes . 

A number of models exist for the study of obesity 
(see, e.g. . Bray, G . A. , 1992, Prog. Brain Res. 93 : 333- 

20 341, and Bray, G.A. , 1989, Amer. J. Clin. Nutr. 5:891- 
902). For example, animals having mutations which lead 
to syndromes that include obesity symptoms have been 
identified, and attempts have been made to utilize such 
animals as models for the study of obesity. The best 

2 5 studied animal models, to date, for genetic obesity are 

mice models- For reviews, see e.g. , Friedman, J.M. et 
al., 1991, Mamm. Gen. .1:130-144; Friedman, J.M. and 
Liebel, R.L. , 1992, Cell 69:217-220.) 

Studies utilizing mice have confirmed that obesity 

3 0 is a very complex trait with a high degree of 

heritability . Mutations at a number of loci have been 
identified which lead to obese phen o types . These include 
the autosomal recessive mutations obese (ob) , diabetes 
(db) , fat (fat) and tubby ( tub ) . In addition, the 
3 5 autosomal dominant mutations Yellow at the agouti locus 
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and Adipose (Ad) have been shown to contribute to an 
obese phenotype. 

The ob and db mutations are on chromosomes 6 and 
4, respectively, but lead to a complex, clinically 
5 similar phenotype of obesity, evident starting at about 
one month of age, which includes hyperphagia, severe 
abnormalities in glucose and insulin metabolism, very 
poor thermoregulation and non-shivering thermogenesis, 
and extreme torpor and underdevelopment of the lean body 

10 mass. This complex phenotype has made it difficult to 

identify the primary defect attributable to the mutations 
(Bray G . A . , et al., 1989 Amer. J. Clin. Nutr. 5:891-902). 

Using molecular and classical genetic markers, the 
db gene has been mapped to midchromosome 4 (Friedman et 

15 al. , 1991, Mamm. Gen. .1:130-144). The mutation maps to a 
region of the mouse genome that is syntonic with human, 
suggesting that, if there is a human homolog of db, it is 
likely to map to human chromosome lp. 

The ob gene and its human homologue have recently 

20 been cloned (Zhang, Y . et al. , 1994, Nature 372:425-432). 
The gene appears to produce a 4.5 kb adipose tissue 
messenger RNA which contains a 167 amino acid open 
reading frame. The predicted amino acid sequence of the 
ob gene product indicates that it is a secreted protein 

25 and may, therefore, play a role as part of a signalling 
pathway from adipose tissue which may serve to regulate 
some aspect of body fat deposition. Further, recent 
studies have shown that recombinant Ob protein, also 
known as leptin, when exogenously administered, can at 

3 0 least partially correct the obesity-related phenotype 
exhibited by ob mice (Pelleymounter , M.A. et al., 1995, 
Science 269 : 540-543 ; Halalas, J.L. et al. , 1995, Science 
269 :543-546; Campfield, L.A. et al., 1995, Science 
269:546-549). Recent studies have suggested that obese 
35 humans and rodents (other than ob/ob mice) are not 
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defective in their ability to produce ob mRNA or protein, 
and generally produce higher levels than lean individuals 
(Maffei et al. f 1995, Nature Med. 1 ( 11) : 1155-1161 ; 
Considine et al-, 1995, J. Clin. Invest. 95 ( 6) : 2986-2988 ; 
5 Lohnqvist et al., 1995, Nature Med, 1:950-953; Hamilton 
et al., 1995, Nature Med. 1:953-956). These data suggest 
that resistance to normal or elevated levels of Ob may be 
more important than inadequate Ob production in human 
obesity. However, the receptor for the ob gene product, 
10 thought to be expressed in the hypothalamus, remains 
elusive. 

Homozygous mutations at either the fat or tub loci 
cause obesity which develops more slowly than that 
observed in ob and db mice (Coleman, D.L., and Eicher, 
15 E.M., 1990, J. Heredity 81:424-427), with tub obesity 
developing slower than that observed in fat animals. 
This feature of the tub obese phenotype makes the 
development of tub obese phenotype closest in resemblance 
to the manner in which obesity develops in humans. Even 

2 0 so, however, the obese phenotype within such animals can 

be characterized as massive in that animals eventually 
attain body weights which are nearly two times the 
average weight seen in normal mice. 

The fat mutation has been mapped to mouse 

25 chromosome 8, while the tub mutation has been mapped to 
mouse chromosome 7. According to Naggert et al., the fat 
mutation has recently been identified (Naggert, J.K. , et 
al., 1995, Nature Genetics .10:135-141). Specifically, 
the fat mutation appears to be a mutation within the Cpe 

30 locus, which encodes the carboxypeptidase (Cpe) E 
protein. Cpe is an exopeptidase involved in the 
processing of prohormones, including proinsulin. 

The dominant Yellow mutation at the agouti locus, 
causes a pleiotropic syndrome which causes moderate adult 

3 5 onset obesity, a yellow coat color, and a high incidence 
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of tumor formation (Her berg, L . and Coleman, D.L. , 1977, 
Metabolism 26:59), and an abnormal anatomic distribution 
of body fat (Coleman, D.L. , 1978, Diabetologia 14 : 141- 
148) . This mutation may represent the only known example 
5 of a pleiotropic mutation that causes an increase, rather 
than a decrease, in body size. The mutation causes the 
widespread expression of a protein which is normally seen 
only in neonatal skin (Michaud, E.J, et al., 1994, Genes 
Devel. 8:1463-1472). 

10 Other animal models include fa/fa (fatty) rats, 

which bear many similarities to the ob/ob and db/db mice, 
discussed above. One difference is that, while f a/ga, 
rats are very sensitive to cold, their capacity for non- 
shivering thermogenesis is normal. Torpor seems to play 

15 a larger part in the maintenance of obesity in fa/fa rats 
than in the mice mutants. In addition, inbred mouse 
strains such as NZO mice and Japanese KK mice are 
moderately obese. Certain hybrid mice, such as the 
Wellesley mouse, become spontaneously fat. Further, 

2 0 several desert rodents, such as the spiny mouse, do not 

become obese in their natural habitats, but do become so 
when fed on standard laboratory feed. 

Animals which have been used as models for obesity 
have also been developed via physical or pharmacological 
25 methods. For example, bilateral lesions in the 
ventromedial hypothalamus (VMH) and ventrolateral 
hypothalamus (VLH) in the rat are associated, 
respectively, with hyperphagia and gross obesity and with 
aphagia, cachexia and anorexia. Further, it has been 

3 0 demonstrated that feeding monosodium-glutamate (MSG) or 

gold thioglucose to newborn mice also results in an 
obesity syndrome. 

Each of the rodent obesity models is accompanied 
by alterations in carbohydrate metabolism resembling 
3 5 those in Type II diabetes in man. For example, from both 
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ob and db, congenic C57BL/KS mice develop a severe 
diabetes with ultimate 0 cell necrosis and islet atrophy, 
resulting in a relative insulinopenia , while congenic 
C57BL/6J ob and db mice develop a transient insulin- 
5 resistant diabetes that is eventually compensated by 0 
cell hypertrophy resembling human Type II diabetes. 

With respect to ob and db mice, the phenotype of 
these mice resembles human obesity in ways other than the 
development of diabetes, in that the mutant mice eat more 

10 and expend less energy than do lean controls (as do obese 
humans) . This phenotype is also quite similar to that 
seen in animals with lesions of the ventromedial 
hypothalamus, which suggests that both mutations may 
interfere with the ability to properly integrate or 

15 respond to nutritional information within the central 
nervous system. Support for this hypothesis comes from 
the results of parabiosis experiments (Coleman, D.L. 
1973, Diabetologica 9:294-298) that suggest ob mice are 
deficient in a circulating satiety factor and that db 

20 mice are resistant to the effects of the ob factor. 

These experiments have led to the conclusion that obesity 
in these mutant mice may result from different defects in 
an afferent loop and/or integrative center of the 
postulated feedback mechanism that controls body 

2 5 composition. 

In summary, therefore, obesity, which poses a 
major, worldwide health problem, represents a complex, 
highly heritable trait. Given the severity, prevalence 
and potential heterogeneity of such disorders, there 
30 exists a great need for the identification of those genes 
and gene products that participate in the control of body 
weight. 

It is an objective of the invention to provide 
modulators of body weight, to provide methods for 

3 5 diagnosis of body weight disorders, to provide therapy 
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for such disorders and to provide an assay system for the 
screening of substances which can be used to control body 
weight . 

III.* SUMMARY OF THE INVENTION 

5 The present invention relates to the discovery, 

identification and characterization of nucleotides that 
encode Ob receptor (ObR) , a novel receptor protein that 
participates in the control of mammalian body weight. 
ObR, described for the first time herein, is a 

10 transmembrane protein that spans the cellular membrane 
once and is involved in signal transduction triggered by 
the binding of its natural ligand, Ob, also known as 
leptin. ObR has amino acid sequence motifs found in the 
Class I cytokine receptor family, and is most related to 

15 the gpl3 0 signal transducing component of the IL-6 

receptor, the G-CSF receptor, and the LIF receptor- The 
results presented in the working examples herein 
demonstrate that a long-form ObR (predominantly expressed 
in the hypothalamus) transduces signal via a STAT 

2 0 mediated pathway typical of IL-6 type cytokine receptors, 

whereas a major naturally occurring truncated form or a 
mutant form found in obese db/db mice does not. The long 
form ObR can mediate activation of STAT proteins and 
stimulate transcription through IL-6 responsive gene 
25 elements. Reconstitution experiments indicate that, 
although ObR mediates intracellular signals with a 
specificity similar to IL-6 type cytokine receptors, 
signaling appears to be independent of the gpl30 signal 
transducing component of the IL-6 type cytokine 

3 0 receptors. 

The ObR mRNA transcript, which is about 5 kb long, 
is expressed in the choroid plexus, the hypothalamus and 
other tissues, including lung and liver. The murine 
short forms described herein encode receptor proteins of 
35 894 (FIG. 1) and 893 amino acids; murine long form obR 
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cDNAs and human obR cDNAs , described herein, encode 
receptor proteins of 1162 amino acids and 1165 amino 
acids, respectively (FIG. 6 and FIG. 3, respectively). 
The ObR has a typical hydrophobic leader sequence (about 
5 22 amino acids long in both forms of murine ObR, and 

about 2 0 amino acids long in human ObR) ; an extracellular 
domain (about 815 amino acids long in both forms of 
murine ObR, and about 819 amino acids long in human ObR) ; 
a short transmembrane region (about 2 3 amino acids long 

10 in both forms of murine ObR and human ObR) ; and a 

cytoplasmic domain. The transcripts encoding the murine 
ObR short (FIG. 1) and long form (FIG. 6) are identical 
until the fifth codon 5' of the stop codon of the short 
form and then diverge completely, suggestive of 

15 alternative splicing. As described herein, the 

cytoplasmic domain encoded by the 894 amino acid murine 
short form obR cDNA is 3 4 amino acids, while that encoded 
by the murine long form obR cDNA (3 02 amino acids) is 
approximately the same length as the cytoplasmic domain 

2 0 encoded by the human obR cDNA (3 03 amino acids) . The 

deduced amino acid sequences from murine long form ObR 
and human ObR are homologous throughout the length of the 
coding region and share 75% identity (FIG. 7) . 

The obese phenotype of the db mouse results from a 
25 G~>T transversion in the obR gene. This transversion 
creates a splice donor site which in turn leads to 
aberrant processing of obR long form mRNA in db mutants. 
In db mutants this aberrant processing generates long 
form mRNAs which encode a truncated ObR protein that is 

3 0 identical to the 894 amino acid short form ObR. Like the 

short form ObR, the mutant long form ObR lacks most of 
the cytoplasmic domain and is incapable of transducing a 
signal via a STAT mediated pathway. The signalling 
competent long form ObR, which is absent in the db/db 
35 mice, is required for body weight maintenance. 
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The invention encompasses the following 
nucleotides, host cells expressing such nucleotides, and 
the expression products of such nucleotides: (a) 
nucleotides that encode mammalian ObRs, including the 
5 human ObR, and the obR gene product; (b) nucleotides that 
encode portions of the ObR that correspond to its 
functional domains, and the polypeptide products 
specified by such nucleotide sequences, including but not 
limited to the extracellular domain (ECD) , the 
10 transmembrane domain (TM) , and the cytoplasmic domain 
(CD) ; (c) nucleotides that encode mutants of the ObR in 
which all or a part of one of the domains is deleted or 
altered, and the polypeptide products specified by such 
nucleotide sequences, including but not limited to 
15 soluble receptors in which all or a portion of the TM is 
deleted, and nonfunctional receptors in which all or a 
portion of the CD is deleted; (d) nucleotides that encode 
fusion proteins containing the ObR or one of its domains 
f e.g. . the extracellular domain) fused to another 
20 polypeptide* 

The invention also encompasses agonists and 
antagonists of ObR, including small molecules, large 
molecules, mutant Ob proteins that compete with native 
Ob, and antibodies, as well as nucleotide sequences that 
2 5 can be used to inhibit obR gene expression (e.g. , 

antisense and ribozyme molecules, and gene or regulatory 
sequence replacement constructs) or to enhance obR gene 
expression ( e.g. . expression constructs that place the 
obR gene under the control of a strong promoter system) , 
30 and transgenic animals that express an obR transgene or 
"knock-outs" that do not express ObR. 

In addition, the present invention encompasses 
methods and compositions for the diagnostic evaluation, 
typing and prognosis of body weight disorders, including 
35 obesity and cachexia, and for the identification of 
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subjects having a predisposition to such conditions. For 
example, obR nucleic acid molecules of the invention can 
be used as diagnostic hybridization probes or as primers 
for diagnostic PCR analysis for the identification of obR 
5 gene mutations, allelic variations and regulatory defects 
in the obR gene. The present invention further provides 
for diagnostic kits for the practice of such methods. 

Further, the present invention also relates to 
methods for the use of the obR gene and/ or obR gene 

10 products for the identification of compounds which 

modulate, i.e. , act as agonists or antagonists, of obR 
gene expression and or obR gene product activity. Such 
compounds can be used as agents to control body weight 
and, in particular, as therapeutic agents for the 

15 treatment of body weight and body weight disorders, 
including obesity, cachexia and anorexia. 

Still further, the invention encompasses methods 
and compositions for the treatment of body weight 
disorders, including obesity, cachexia, and anorexia. 

20 Such methods and compositions are capable of modulating 
the level of obR gene expression and/or the level of obR 
gene product activity. 

This invention is based, in part, on the 
surprising discovery, after an extensive survey of 

25 numerous cell lines and tissues, of a high affinity 

receptor for Ob in the choroid plexus of the brain, the 
identification and cloning of obR cDNA from a library 
prepared from choroid plexus mRNA, characterization of 
its novel sequence, mapping the obR gene to the same 

3 0 genetic interval in the mouse genome as the db gene maps, 
and characterization of the ObR as a transmembrane 
receptor of the Class I cytokine receptor family. obR 
mRNA was detected in other tissues, including the 
hypothalamus . 
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The full-length ObR, expressed predominantly in 
the hypothalamus signals transduces through activation of 
STAT proteins and stimulation of transcription through 
IL-6 responsive gene elements. The ability of the full- 
5 length long form ObR to signal is in contrast to the 
naturally occurring truncated form or the mutant form 
found in db/db mice which are unable to mediate signal 
transduction. The invention also includes forms of ObR 
lacking one or another of the intracellular domains 
10 important for signalling and induction of gene 
expression. 

A. . DEFINITIONS 

As used herein, the following terms, whether used 
in the singular or plural, will have the meanings 
15 indicated: 

Ob : means the Ob protein described in Zhang, 
Y. et al., 1994, Nature 372.: 42 5-432 , which is 
incorporated herein by reference in its entirety, 
which is also known as leptin. Ob includes 

20 molecules that are homologous to Ob or which bind 

to ObR. Ob fusion proteins having an N-terminal 
alkaline phosphatase domain are referred to herein 
as AP-Ob fusion proteins, while Ob fusion proteins 
having a C-terminal alkaline phosphatase domain 

25 are referred to herein as Ob-AP fusion proteins. 

obR nucleotides or coding sequences: means 
nucleotide sequences encoding ObR protein, 
polypeptide or peptide fragments of ObR protein, 
or ObR fusion proteins. obR nucleotide sequences 

3 0 encompass DNA, including genomic DNA ( e.g . the obR 

gene) or cDNA, or RNA. 

ObR: means Ob receptor protein. 
Polypeptides or peptide fragments of ObR protein 
are referred to as ObR polypeptides or ObR 

3 5 peptides. Fusions of ObR, or ObR polypeptides or 
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peptide fragments to an unrelated protein are 
referred to herein as ObR fusion proteins. 
A functional ObR refers to a protein which binds 
Ob with high affinity in vivo or in vitro. 
5 ECD: means "extracellular domain". 

TM: means "transmembrane domain", 
CD: means "cytoplasmic domain". 

IV. . DESCRIPTION OF THE FIGURES 

FIG. 1. Nucleotide sequence (SEQ. ID. No:l) and 
10 deduced amino acid sequence (SEQ. ID. No: 2) of murine obR 
(short form) cDNA encoding murine short form ObR protein 
(894 amino acids) . The domains of short form murine ObR 
are: signal sequence (amino acid residues 1 to about 
22) , extracellular domain (from about amino acid residue 
15 2 3 to about 8 37) , transmembrane domain (from about amino 
acid residue 838 to about 860) , and cytoplasmic domain 
(from about amino acid residue 861 to 894) . Potential N- 
linked glycosylation sites in the extracellular domain 
are indicated by asterisks above the first amino acid of 
2 0 the N-X-S and N-X-T motifs. Underscores indicate motifs 
conserved in the class I cytokine receptor family. 

FIGS. 2A-2B. AP-Ob fusion protein binding studies. 
FIG. 2A. COS-7 cells transfected with the ObR 
cDNA were treated with various AP or AP-Ob fusion 

2 5 proteins at InM (diluted in DMEM + 10% FBS) . Columns 

show the average of two binding determinations and error 
bars show the difference between the two. 1) Unfused AP, 
2) AP-Ob (mouse), 3) AP-Ob (mouse) + 100 nM mouse Ob, 4) 
AP-Ob (mouse) + lOOnM human Ob, 5) AP-Ob (human), 6) Ob- 

3 0 AP (mouse) , 7) AP-Ob (mouse) incubated with mock 

transfected (vector- no insert) COS-7 cells. 

FIG. 2B. Binding isotherm and Scatchard analysis 
of the interaction of AP-Ob and ObR. COS-7 cells 
transfected with the obR cDNA were incubated with various 
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concentrations of the AP-Ob (mouse) fusion protein. 
Scatchard transformation is shown as an inset. 

FIG. 3. Nucleotide sequence (SEQ - ID. No.: 3) and 
deduced amino acid sequence (SEQ. ID. No.: 4) of human obR 
5 cDNA encoding human ObR protein. The domains of human 
ObR are: signal sequence (from amino acid residue l to 
about 20) , extracellular domain (from about amino acid 
residue 21 to about 839) , transmembrane domain (from 
about amino acid residue 84 0 to about 862) , and 

10 cytoplasmic domain (from about amino acid residue 863 to 
1165). Also depicted are 5' untranslated nucleotide 
sequences. Potential N-linked glycosylation sites in the 
extracellular domain are indicated by asterisks above the 
first amino acid of the N-X-S and N-X-T motifs. 

15 Underscores indicate motifs conserved in the class I 
cytokine receptor f ami ly . 

FIG. 4. Alignment of the extracellular domains of 
the murine ObR and human gpl3 0. Identical residues 
(black) and conservative changes (gray) are indicated by 

2 0 shading around the corresponding amino acids. 

Conservative changes indicated are as defined by FASTA . 

FIG. 5. Alignment of mouse ObR (short form shown 
in FIG. 1) and human ObR. Amino acids that are identical 
between the two sequences are indicated by a star. 

25 FIG. 6. Nucleotide sequence and deduced amino 

acid sequence of murine long form obR cDNA encoding 
murine long form ObR protein. The domains of long form 
murine ObR are: signal sequence (amino acid residues 1 
to about 22) , extracellular domain (from about amino acid 

30 residue 2 3 to about 837) , transmembrane domain (from 
about amino acid residue 838 to about 860) , and 
cytoplasmic domain (from about amino acid residue 8 61 to 
1162) . 

FIG. 7. Alignment of the long forms of human and 

3 5 murine ObR. Identical residues and conservative changes 
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are indicated by two asterisks or one asterisk, 
respectively. Conservative changes indicated are as 
defined by FASTA. Abbreviations: mobr-1, murine ObR 
long form; and hobr, human homolog. 
5 FIG. 8, Location of the gene encoding ObR on 

mouse chromosome 4 . 

FIG. 9. Nucleotide sequence of the 106 base pair 
insert in the long form transcript of db/db. The precise 
position of the insertion in the deduced amino acid 
10 sequence near the insertion region are shown. 

FIG. 10. Bar graph depicting ObR-Ig 
neutralization of OB protein. COS cell were transiently 
transfected with the ObR cDNA and tested for their 
ability to bind 0.5 nM AP-OB. Column 1 shows the high 
15 levels of specific binding observed in the absence of 

ObR-IgG fusion protein. Columns 2, 3 and 4 show the near 
complete inhibition of binding observed with three 
different column fractions of purified ObR IgG. 

FIG. llA. Schematic drawings of various C- 
20 terminal deletion mutants of ObR protein. The names and 
predicted length (aa) of the proteins are shown above 
each protein. The extracellular domains are shown as 
striped, the transmembrane domains are shown as black, 
and the cytoplasmic domains are shown as white. The 
25 location of tyrosine residues in the cytoplasmic domain 
are indicated by horizontal bars (Y 986 , Y1079, and Y1141 
are conserved between human and murine ObR) . The length 
of the cytoplasmic domains (aa) are shown below each 
protein. 

30 FXG * 13 -B. A bar graph depicting the results of 

CAT assays employing a IL-6RE-CAT expression construct 
(upper panel) or a HRRE-CAT expression construct (lower 
panel) and the ObR deletion mutants of FIG. HA. H-35 
cells were transfected with cDNAs encoding the ObR mutant 

35 and either I1-6RE-CAT or HRRE-CAT. Subcultures of cells 
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were treated for 24 hours with serum-free medium alone (- 
) or serum-free medium containing mouse leptin (+) . CAT 
activity was determined and is expressed relative to 
values obtained for untreated control cultures. 
5 FIG- 12. A bar graph depicting the results of an 

AP-Ob fusion protein binding assay. COS-7 cells were 
transfected with a cDNA encoding the indicated ObR 
protein. Forty-eight hours later cells were incubated 
with 1 mM AP-Ob fusion protein. Bars show the average of 

10 two binding assays. The error bars indicate the 
difference between the two assays. 

FIG. 13 A. Schematic drawings of various mutant 
ObR proteins. The location of tyrosine residues 98 6 and 
1079 are indicated. The location of the * box 1" sequence 

15 is also indicated. 

FIG. 13B. Bar graph depicting the results of a 
HRRE-CAT induction assay. H-35 cells were co-transf ected 
with HRRE-CAT and expression constructs for either 0B- 
RY986F, OB-RY1079F or OB-R(box lmt) . Subcultures of 

20 cells were treated for 24 h with serum-free medium 

containing human leptin. CAT activity was determined and 
is expressed relative to values obtained for untreated 
control cultures . 

FIG. 14 A. Schematic drawings of various receptor 

25 chimeras. The portions derived from G-CSFR are shaded; 

the portions derived from ObR are not. The locations of ' 
the predicted Box l, Box 2, and Box 3 motifs are 
indicated. 

FIG. 14 B. Bar graphs depicting the results of IL- 
30 6RE-CAT (left panel) and HRRE-CAT induction assays. H-35 
cells were co-transf ected with expression plasmids for 
the indicated receptor (ObR, G-CSFR, or chimeric) and IL- 
6-RE-CAT or HREE-CAT expression construct. Cells were 
stimulated with the appropriate ligand and CAT activity 
3 5 was determined as in the experiments described in FIG. 
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11B. All values are expressed relative to untreated 
control cultures (mean ± std deviation of 3 to 4 
experiments) . 

FIG. 15A. Bar graph depicting the results of 
5 HRRE-CAT induction assays. H-35 cells were co- 

transf ected with HRRE-CAT and the indicated amount of ObR 
and OB-RA868-1165. Cells were stimulated with leptin, 
and CAT activity was determined as in the experiments 
described in FIG. 11. All values are expressed relative 

10 to the untreated cultures. 

FIG. 15B. Bar graph depicting the results of IL- 
6RE-CAT induction assays. H-35 cells were co-transf ected 
with IL-6RE-CAT and the indicated amount of ObR/G-CSFR 
and OB-RA868-1165. Cells were stimulated with leptin, 

15 and CAT activity was determined as in the experiments 

described in FIG. 11. All values are expressed relative 
to the untreated cultures. 

FIG. 15C. Bar graph depicting the results of IL- 
6 RE— CAT induction assays. H-3 5 cells were co-transf ected 

2 0 with IL-6RE-CAT and the indicated amount of G-CSFR and G- 
CSFR(Acyto) . Cells were stimulated with G-CSF, and CAT 
activity was determined as in the experiments described 
in FIG. 11. All values are expressed relative to the 
untreated cultures . 

25 FIG. 15D. Bar graph depicting the results of IL- 

6RE— CAT induction assays. H-3 5 cells were co-transf ected 
with IL-6RE-CAT and the indicated amount of G-CSFR/ObR 
and G-CSFR( Acyto) . Cells were stimulated with G-CSF, and 
CAT activity was determined as in the experiments 

30 described in FIG. 11. All values are expressed relative 
to the untreated cultures . 

FIG. 15E. Bar graph depicting the results of IL- 
6RE— CAT induction assays. H-3 5 cells were co-transf ected 
with IL-6RE-CAT and the indicated amount of ObR and OB- 

35 RY1141F. Cells were stimulated with leptin, and CAT 
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activity was determined as in the experiments described 
in FIG. 11. All values are expressed relative to the 
untreated cultures. 

V. . DETAILED DESCRIPTION OF THE INVENTION 

5 ObR, described for the first time herein, is a 

novel receptor protein that participates in body weight 
regulation. ObR is a transmembrane protein that spans 
the membrane once and belongs to the class I family of 
cytokine receptors, and is most closely related to the 

10 gpl3 0 signal transducing component of the IL-6 receptor, 
the G-CSF receptor, and the LIF receptor. Signal 
transduction is triggered by the binding of Ob to the 
receptor. Neutralization of Ob, removal of Ob, or 
interference with its binding to ObR results in weight 

15 gain. ObR mRNA is detected in the choroid plexus, and 
other tissues, including the hypothalamus. 

The invention encompasses the use of obR 
nucleotides, ObR proteins and peptides, as well as 
antibodies to the ObR (which can, for example, act as ObR 

2 0 agonists or antagonists) , antagonists that inhibit 

receptor activity or expression, or agonists that 
activate receptor activity or increase its expression in 
the diagnosis and treatment of body weight disorders, 
including, but not limited to obesity, cachexia and 
25 anorexia in animals, including humans. The diagnosis of" 
an ObR abnormality in a patient, or an abnormality in the 
ObR signal transduction pathway, will assist in devising 
a proper treatment or therapeutic regimen. In addition, 
obR nucleotides and ObR proteins are useful for the 

3 0 identification of compounds effective in the treatment of 

body weight disorders regulated by the ObR. 

In particular, the invention described in the 
subsections below encompasses ObR, polypeptides or 
peptides corresponding to functional domains of the ObR 
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( e.g. . ECD, TM or CD) , mutated, truncated or deleted ObRs 
( e.g. an ObR with one or more functional domains or 
portions thereof deleted, such as ATM and/or A CD) , ObR 
fusion proteins ( e.g. an ObR or a functional domain of 
5 ObR, such as the ECD, fused to an unrelated protein or 
peptide such as an immunoglobulin constant region, i.e. , 
igFc) , nucleotide sequences encoding such products, and 
host cell expression systems that can produce such ^pbR 
products . 

10 The invention also features Ob receptors having an 

amino acid sequence that is substantially identical to a 
defined amino acid sequence. 

By "substantially identical" is meant a 
polypeptide or nucleic acid having a sequence that is at 

15 least 85%, preferably 90%, and more preferably 95% or 
more identical to the sequence of the reference amino 
acid or nucleic acid sequence. For polypeptides, the 
length of the reference polypeptide sequence will 
generally be at least 16 amino acids, preferably at least 

20 20 amino acids, more preferably at least 25 amino acids, 
and most preferably 3 5 amino acids. For nucleic acids, 
the length of the reference nucleic acid sequence will 
generally be at least 50 nucleotides, preferably at least 
60 nucleotides, more preferably at least 75 nucleotides, 

25 and most preferably 110 nucleotides. 

Sequence identity can be measured using sequence 
analysis software (e.g., Sequence Analysis Software 
Package of the Genetics Computer Group, University of 
Wisconsin Biotechnology Center, 1710 University Avenue, 

30 Madison, WI 53705) . 

In the case of polypeptide sequences which are 
less than 100% identical to a reference sequence, the 
non- identical positions are preferably, but not 
necessarily, conservative substitutions for the reference 

35 sequence. Conservative substitutions typically include 
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substitutions within the following groups: glycine and 
alanine; valine, isoleucine, and leucine; aspartic acid 
and glutamic acid; asparagine and glutamine; serine and 
threonine; lysine and arginine; and phenylalanine and 
5 tyrosine. 

Where a particular polypeptide is said to have a 
specific percent identity to a reference polypeptide of a 
defined length, the percent identity is relative to the 
reference peptide. Thus, a peptide that is 50% identical 
10 to a reference polypeptide that is 100 amino acids long 
can be a 50 amino acid polypeptide that is completely 
identical to a 50 amino acid long portion of the 
reference polypeptide. It might also be a 100 amino acid 
long polypeptide which is 50% identical to the reference 
15 polypeptide over its entire length. Of course, many 
other polypeptides will meet the same criteria. 

The invention also encompasses antibodies and 
ant i- idiotypic antibodies (including Fab fragments) , 
antagonists and agonists of the ObR, as well as compounds 
20 or nucleotide constructs that inhibit expression of the 
obR gene (transcription factor inhibitors, antisense and 
ribozyme molecules, or gene or regulatory sequence 
replacement constructs) , or promote expression of ObR 
( e.g. , expression constructs in which obR coding 
25 sequences are operatively associated with expression 

control elements such as promoters, promoter /enhancers, 
etc.). The invention also relates to host cells and 
animals genetically engineered to express the human ObR 
(or mutants thereof) or to inhibit or "knock-out 11 
3 0 expression of the animal's endogenous ObR. 

The ObR proteins or peptides, ObR fusion proteins, 
obR nucleotide sequences, antibodies, antagonists and 
agonists can be useful for the detection of mutant ObRs 
or inappropriately expressed ObRs for the diagnosis of 
3 5 body weight disorders such as obesity, anorexia or 
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cachexia. The ObR proteins or peptides, ObR fusion 
proteins, obR nucleotide sequences, host cell expression 
systems, antibodies, antagonists, agonists and 
genetically engineered cells and animals can be used for 
5 screening for drugs effective in the treatment of such 
body weight disorders. The use of engineered host cells 
and/or animals may offer an advantage in that such 
systems allow not only for the identification of 
compounds that bind to the ECD of the ObR, but can also 

10 identify compounds that affect the signal transduced by 
the activated ObR. 

Finally, the ObR protein products (especially 
soluble derivatives such as peptides corresponding to the 
ObR ECD, or truncated polypeptides lacking the TM domain) 

15 and fusion protein products (especially ObR-Ig fusion 
proteins, i.e. . fusions of the ObR or a domain of the 
ObR, e.g. , ECD, ATM to an IgFc) , antibodies and anti- 
idiotypic antibodies (including Fab fragments) , 
antagonists or agonists (including compounds that 

20 modulate signal transduction which may act on downstream 
targets in the ObR signal transduction pathway) can be 
used for therapy of such diseases. For example, the 
administration of an effective amount of soluble ObR ECD, 
ATM ObR or an ECD-IgFc fusion protein or an anti- 

2 5 idiotypic antibody (or its Fab) that mimics the ObR ECD 

would "mop up" or "neutralize" endogenous Ob, and prevent 
or reduce binding and receptor activation, leading to 
weight gain. Nucleotide constructs encoding such ObR 
products can be used to genetically engineer host cells 

3 0 to express such ObR products in vivo ; these genetically 

engineered cells function as "bioreactors" in the body 
delivering a continuous supply of the ObR, ObR peptide, 
soluble ECD or ATM or ObR fusion protein that will "mop 
up" or neutralize Ob. Nucleotide constructs encoding 
3 5 functional ObRs, mutant ObRs f as well as antisense and 
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ribozyme molecules can be used in "gene therapy" 
approaches for the modulation of ObR expression and/or 
activity in the treatment of body weight disorders. 
Thus, the invention also encompasses pharmaceutical 
5 formulations and methods for treating body weight 
disorders. 

The invention is based, in part, on the surprising 
discovery of a high affinity receptor for Ob expressed at 
significant concentration in the choroid plexus. This 

10 discovery was made possible by using a novel alkaline 
phosphatase/Ob (AP-Ob) fusion protein for in situ 
staining of cells and tissue. Competition studies with 
unlabeled Ob confirmed that the in situ binding observed 
was specific for Ob. Murine obR cDNA was identified 

15 using AP-Ob fusion protein to screen an expression 

library of cDNAs synthesized from murine choroid plexus 
inRNA and transiently transfected into mammalian COS 
cells. A clone, famj5312, expressing the short form of a 
high affinity receptor for Ob was identified and 

2 0 sequenced. Sequence analysis revealed that the obR cDNA 

and predicted amino acid sequence are novel sequences 
containing amino acid regions indicating that ObR is a 
member of the Class I family of receptor proteins. 
Mapping studies described herein demonstrate that the obR 
25 gene maps to the db locus. The data presented herein 
demonstrate further that the sib gene is a mutant obR 
gene, which expresses an aberrantly spliced obR long form 
message that encodes a protein identical to the short 
form murine ObR. The famj5312 sequence was utilized to 

3 0 screen a human fetal brain cDNA library, which resulted 

in the identification of a human obR cDNA clone 
fahj5312d, described herein. oligonucleotide primers 
designed on the basis of the human cDNA sequence were 
used to clone the human genomic DNA clone, h-obR-p87 , 
3 5 also described herein. mRNA encoding the murine long 
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form of ObR was cloned from murine hypothalamus using 
degenerate primers designed on the human ObR cytoplasmic 
domain. 

Various aspects of the invention are described in 
5 greater detail in the subsections below. 

A. . THE ObR GENE 

The cDNA sequence ( SEQ . ID. No. 1) and deduced 
amino acid sequence (SEQ. ID. No. 2) of murine short form 
(894 amino acids long) and murine long form ObR are shown 

10 in FIGs. 1 and 6, respectively. The signal sequence of 
both murine short and long form ObR extends from amino 
acid residue 1 to about 22 of FIGs . 1 and 6, 
respectively; the extracellular domain of both forms of 
murine ObR extends from about amino acid residue number 

15 2 3 to about 837 of FIGs. 1 and 6; the transmembrane 
domain of both forms of murine ObR extends from about 
amino acid residue 838 to about 860 of FIGs. 1 and 6; and 
the cytoplasmic domain of the murine short form ObR 
extends from about amino acid residue 861 to 894 of FIG. 

2 0 1 while that of the long form extends from amino acid 

residue 861 to 1162 of FIG. 6. At least one other short 
form murine ObR has been identified which is one amino 
acid shorter ( i.e. . 89 3 amino acids) than the sequence 
shown in FIG. 1. The sequence at the C-terminus differs 
25 from the sequence shown in FIG. 1, in that residues 890 
- 894 (RTDTL) are not present; and instead, residues 890- 
893 of the second short form have the following sequence: 
IMWI. 

The cDNA sequence (SEQ. ID. No: 3) and deduced 

3 0 amino acid sequence (SEQ. ID. No: 4) of human ObR are 

shown in FIG. 3. The human ObR signal sequence extends 
from amino acid residue 1 to about 20 of FIG. 3; the 
extracellular domain of human ObR extends from about 
amino acid residue 21 to about 839 of FIG. 3; the 
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transmembrane domain of human ObR extends from about 
amino acid residue number 840 to about 862 of FIG. 3; and 
the cytoplasmic domain of human ObR extends from about 
amino acid residue number 863 to 1165 of FIG, 3. 
5 Sequences derived from the human cDNA clone were used to 
design primers that were used to clone the human genomic 
obR . h-obR-p87, as described in the examples, infra . 

Data presented in the working examples, infra , 
demonstrate that the obR gene maps to the db locus , and 
10 that the db gene is a mutant obR gene which is expressed 
in db mice as an aberrantly spliced transcript resulting 
in an mRNA species containing an insert of approximately 
106 nucleotides (nt) in the portion encoding the 
cytoplasmic domain of ObR. The insert produces a 
15 mutation that results in a transcript that encodes a 
prematurely truncated long form that is identical to 
murine short form ObR. 

The obR nucleotide sequences of the invention 
include: (a) the DNA sequence shown in FIG . l, 3 or 6 or 
20 contained in the cDNA clone famj5312 within E.coli strain 
5312B4F3 as deposited with the American Type Culture 
Collection (ATCC) , or contained in the cDNA clone 
fahj5312d within E. coli strain h-obRD as deposited with 
the ATCC, or contained in the human genomic clone, h-obR- 
25 p87 as deposited with the ATCC; (b) nucleotide sequence 
that encodes the amino acid sequence shown in FIG. 1, 3 
or 6, or the ObR amino acid sequence encoded by the cDNA 
clone famj5312 as deposited with the ATCC, or the cDNA 
clone fahj5312d as deposited with the ATCC, or contained 
3 0 in the human genomic clone, h-obR-p87 as deposited with 
the ATCC; (c) any nucleotide sequence that hybridizes to 
the complement of the DNA sequence shown in FIG. 1, 3 or 
6 or contained in the cDNA clone famj5312 as deposited 
with the ATCC, or contained in the cDNA clone fahj53l2d 
3 5 as deposited with the ATCC, or contained in the human 
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genomic clone, h-obR-p87 as deposited with the ATCC under 
highly stringent conditions, e,q. . hybridization to 
filter-bound DNA in 0.5 M NaHP0 4 , 7% sodium dodecyl 
sulfate (SDS) , 1 inM EDTA at 65°C, and washing in 
5 O.lxSSC/0.1% SDS at 68 °C (Ausubel F.M. et al., eds., 
1989, Current Protocols in Molecular Biology, Vol. I, 
Green Publishing Associates, Inc., and John Wiley & sons, 
Inc., New York, at p. 2.10.3) and encodes a functionally 
equivalent gene product; and (d) any nucleotide sequence 

10 that hybridizes to the complement of the DNA sequences 

that encode the amino acid sequence shown in FIG. 1, 3 of 
6 contained in cDNA clone famj5312 as deposited with the 
ATCC, or contained in the cDNA clone fahj5312d as 
deposited with the ATCC, or contained in the human 

15 genomic clone, h-obR-p87 as deposited with the ATCC under 
less stringent conditions, such as moderately stringent 
conditions, e.g. , washing in 0.2xSSC/0.1% SDS at 42 °C 
(Ausubel et al. , 1989, supra), yet which still encodes a 
functionally equivalent obR gene product. Functional 

2 0 equivalents of the ObR include naturally occurring ObR 

present in other species, and mutant ObRs whether 
naturally occurring or engineered. The invention also 
includes degenerate variants of sequences (a) through 
(d) . 

25 The invention also includes nucleic acid 

molecules, preferably DNA molecules, that hybridize to, 
and are therefore the complements of, the nucleotide 
sequences (a) through (d) , in the preceding paragraph. 
Such hybridization conditions may be highly stringent or 

3 0 less highly stringent, as described above. In instances 

wherein the nucleic acid molecules are 
deoxyoligonucleotides ("oligos"), highly stringent 
conditions may refer, e.g. . to washing in 6xSSC/0.05% 
sodium pyrophosphate at 37°C (for 14-base oligos), 48°C 
35 (for 17-base oligos), 55°C (for 20-base oligos), and 60°C 
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(for 2 3-base oligos) . These nucleic acid molecules may 
encode or act as obR antisense molecules, useful, for 
example, in obR gene regulation (for and/or as antisense 
primers in amplification reactions of obR gene nucleic 
5 acid sequences) . With respect to obR gene regulation, 
such techniques can be used to regulate, for example, 
cachexia and/or anorexia. Further, such sequences may be 
used as part of ribozyme and/or triple helix sequences, 
also useful for obR gene regulation. Still further, such 

10 molecules may be used as components of diagnostic methods 
whereby, for example, the presence of a particular obR 
allele responsible for causing a weight disorder, such as 
obesity, may be detected. 

In addition to the obR nucleotide sequences 

15 described above, full length obR cDNA or gene sequences 
present in the same species and/ or homologs of the obR 
gene present in other species can be identified and 
readily isolated, without undue experimentation, by 
molecular biological techniques well known in the art. 

2 0 The identification of homologs of obR in related species 
can be useful for developing animal model systems more 
closely related to humans for purposes of drug discovery. 
For example, expression libraries of cDNAs synthesized 
from choroid plexus mRNA derived from the organism of 

25 interest can be screened using labeled Ob derived from 
that species, e.g. . an AP-Ob fusion protein. 
Alternatively, such cDNA libraries, or genomic DNA 
libraries derived from the organism of interest can be 
screened by hybridization using the nucleotides described 

30 herein as hybridization or amplification probes. 

Furthermore, genes at other genetic loci within the 
genome that encode proteins which have extensive homology 
to one or more domains of the obR gene product can also 
be identified via similar techniques. In the case of 

35 cDNA libraries, such screening techniques can identify 
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clones derived from alternatively spliced transcripts in 
the same or different species. 

Screening can be by filter hybridization, using 
duplicate filters. The labeled probe can contain at 
5 least 15-3 0 base pairs of the a&B nucleotide sequence, as 
shown in FIG. l, 3 or 6. The hybridization washing 
conditions used should be of a lower stringency when the 
cDNA library is derived from an organism different from 
the type of organism from which the labeled sequence was 
10 derived. With respect to the cloning of a human obR 
homolog, using murine obR probes, for example, 
hybridization can, for example, be performed at 65 «c 
overnight in Church's buffer (7% SDS, 2 50 mM NaHP0 4 , 2(M 
EDTA, 1% BSA) . Washes can be done with 2XSSC, 0.1% SDS 
15 at 65 "C and then at 0.1XSSC, o.l% SDS at 65 °c. 

Low stringency conditions are well known to those 
of skill in the art, and will vary predictably depending 
on the specific organisms from which the library and the 
labeled sequences are derived. For guidance regarding 
such conditions see, for example, Sambrook et al. , 1989, 
Molecular Cloning, A Laboratory Manual, Cold Springs 
Harbor Press, N.Y. ; and Ausubel et al., 1989, Current 
Protocols in Molecular Biology, Green Publishing 
Associates and Wiley Interscience, N.Y. 

Alternatively, the labeled obR nucleotide probe 
may be used to screen a genomic library derived from the 
organism of interest, again, using appropriately 
stringent conditions. The identification and 
characterization of human genomic clones is helpful for 
3 0 designing diagnostic tests and clinical protocols for 
treating body weight disorders in human patients. For 
example, sequences derived from regions adjacent to the 
intron/exon boundaries of the human gene can be used to 
design primers for use in amplification assays to detect 
3 5 mutations within the exons, introns, splice sites ( e.g. 
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splice acceptor and/or donor sites), etc., that can be 
used in diagnostics. 

Further, an sfe£ gene homolog may be isolated from 
nucleic acid of the organism of interest by performing 
■ PCR using two degenerate oligonucleotide primer pools 
designed on the basis of amino acid sequences within the 
SbB gene product disclosed herein. The template for the 
reaction may be cDNA obtained by reverse transcription of 
mRNA prepared from, for example, human or non-human cell 
lines or tissue, such as choroid plexus, known or 
suspected to express an obR gene allele. 

The PCR product may be subcloned and sequenced to 
ensure that the amplified sequences represent the 
sequences of an obR gene. The PCR fragment may then be 
used to isolate a full length cDNA clone by a variety of 
methods. For example, the amplified fragment may be 
labeled and used to screen a cDNA library, such as a 
bacteriophage cDNA library. Alternatively, the labeled 
fragment may be used to isolate genomic clones via the 
screening of a genomic library. 

PCR technology may also be utilized to isolate 
full length cDNA sequences. For example, RNA may be 
isolated, following standard procedures, from an 
appropriate cellular or tissue source ( i.e. f one known, 
or suspected, to express the obR gene, such as, for 
example, choroid plexus or brain tissue) . a reverse 
transcription reaction may be performed on the RNA using 
an oligonucleotide primer specific for the most 5' end of 
the amplified fragment for the priming of first strand 
synthesis. The resulting RNA/DNA hybrid may then be 
"tailed" with guanines using a standard terminal 
transferase reaction, the hybrid may be digested with 
RNAase H, and second strand synthesis may then be primed 
with a poly-c primer. Thus, cDNA sequences upstream of 
the amplified fragment may easily be isolated. For a 



WO 97/19952 



PCT/US96/J9128 



- 29 - 



15 



review of cloning strategies which may be used, see e.g. 
Sambrook et al. , 1989, supra . 

The 0J2B gene sequences may additionally be used to 
isolate mutant obR gene alleles. Such mutant alleles may 
5 be isolated from individuals either known or proposed to 
have a genotype which contributes to the symptoms of body 
weight disorders such as obesity, cachexia or anorexia. 
Mutant alleles and mutant allele products may then be 
utilized in the therapeutic and diagnostic systems 
10 described below. Additionally, such obR gene sequences 
can be used to detect obR gene regulatory (e^, promoter 
or promotor/enhancer) defects which can affect body 
weight . 

A cDNA of a mutant obR gene may be isolated, for 
example, by using PCR, a technique which is well known to 
those of skill in the art. In this case, the first cDNA 
strand may be synthesized by hybridizing an oligo-dT 
oligonucleotide to mRNA isolated from tissue known or 
suspected to be expressed in an individual putatively 
carrying the mutant obR. allele, and by extending the new 
strand with reverse transcriptase. The second strand of 
the cDNA is then synthesized using an oligonucleotide 
that hybridizes specifically to the 5' end of the normal 
gene. Using these two primers, the product is then 
amplified via PCR, cloned into a suitable vector, and 
subjected to DNA sequence analysis through methods well 
known to those of skill in the art. By comparing the DNA 
sequence of the mutant obR allele to that of the normal 
2bR allele, the mutation (s) responsible for the loss or 
alteration of function of the mutant obR. gene product can 
be ascertained. 

Alternatively, a genomic library can be 
constructed using DNA obtained from an individual 
suspected of or known to carry the mutant obR allele, or 
a cDNA library can be constructed using RNA from a tissue 
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known, or suspected, to express the mutant obR allele. 
The normal obR gene or any suitable fragment thereof may 
then be labeled and used as a probe to identify the 
corresponding mutant obR allele in such libraries. 
5 Clones containing the mutant obR gene sequences may then 
be purified and subjected to sequence analysis according 
to methods well known to those of skill in the art. 

Additionally, an expression library can be 
constructed utilizing cDNA synthesized from, for example, 
10 RNA isolated from a tissue known, or suspected, to 

express a mutant ob£ allele in an individual suspected of 
or known to carry such a mutant allele. In this manner, 
gene products made by the putatively mutant tissue may be 
expressed and screened using standard antibody screening 
15 techniques in conjunction with antibodies raised against 
the normal obR gene product, as described, below, in 
Section 5.3. (For screening techniques, see, for 
example, Harlow, E . and Lane, eds., 1988, "Antibodies: A 
Laboratory Manual", Cold Spring Harbor Press, Cold Spring 
20 Harbor.) Additionally, screening can be accomplished by 
screening with labeled Ob fusion proteins, such as, for 
example, AP-Ob or Ob-AP fusion proteins. In cases where 
an obR mutation results in an expressed gene product with 
altered function (e^, as a result of a missense or a 
25 frameshift mutation) , a polyclonal set of antibodies to 
ObR are likely to cross-react with the mutant ObR gene 
product. Library clones detected via their reaction with 
such labeled antibodies can be purified and subjected to 
sequence analysis according to methods well known to 
30 those of skill in the art. 

The invention also encompasses nucleotide 
sequences that encode mutant ObRs, peptide fragments of 
the ObR, truncated ObRs, and ObR fusion proteins. These 
include, but are not limited to nucleotide sequences 
3 5 encoding mutant ObRs described in section 5.2 infra; 
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polypeptides or peptides corresponding to the ECD, TM 
and/ or CD domains of the ObR or portions of these 
domains; truncated ObRs in which one or two of the 
domains is deleted, e^, a soluble ObR lacking the TM or 
both the TM and CD regions, or a truncated, nonfunctional 
ObR lacking all or a portion of the CD region. 
Nucleotides encoding fusion proteins may include by are 
not limited to full length ObR, truncated ObR or peptide 
fragments of ObR fused to an unrelated protein or 
peptide, such as for example, a transmembrane sequence, 
which anchors the ObR ECD to the cell membrane; an Ig Fc 
domain which increases the stability and half life of the 
resulting fusion protein (e^, 6bR-Ig) in the 
bloodstream; or an enzyme, fluorescent protein, 
luminescent protein which can be used as a marker. 

The invention also encompasses (a) DNA vectors 
that contain any of the foregoing ObR coding sequences 
and/or their complements (i^, antisense) ; (b) DNA 
expression vectors that contain any of the foregoing ObR 
coding sequences operatively associated with a regulatory 
element that directs the expression of the codxng 
sequences; and (c) genetically engineered host cells that 
contain any of the foregoing ObR coding sequences 
operatively associated with a regulatory element that 
directs the expression of the coding sequences in the 
host cell. As used herein, regulatory elements include 
but are not limited to inducible and non-inducible 
promoters, enhancers, operators and other elements known 
to those skilled in the art that drive and regulate 
30 expression. such regulatory elements include but are not 
limited to the cytomegalovirus hCMV immediate early gene, 
the early or late promoters of SV40 adenovirus, the lac 
system, the t£i> system, the TAG system, the TRC system, 
the major operator and promoter regions of phage A, the 
35 control regions of fd coat protein, the promoter for 
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3-phosphoglycerate kinase, the promoters of acid 
phosphatase, and the promoters of the yeast a-mating 
factors. 

B . . r.v.T? PBOTE TMB AMD pn T.YPBPTIDBS 

5 ObR protein, polypeptides and peptide fragments, 

mutated, truncated or deleted forms of the ObR and/or ObR 
fusion proteins can be prepared for a variety of uses, 
including but not limited to the generation of 
antibodies, as reagents in diagnostic assays, the 

10 identification of other cellular gene products xnvolved 
in the regulation of body weight, as reagents in assays 
for screening for compounds that can be used in the 
treatment of body weight disorders, and as pharmaceutical 
reagents useful in the treatment of body weight disorders 

15 related to the ObR. 

FIGs. 1 and 6 show the amino acid sequence of a 

murine short form and long form ObR protein, 

respectively. In both of these forms of ObR, the signal 

sequence extends from amino acid 1 to about 22; the ECD 

20 extends from about amino acid 23 to about 837; and the TK 

extends from about amino acid 838 to about 860. In the 

short form of murine ObR, the CD extends from about amino 

acid 861 to about 894 (or to 893 in the second short 

form) , while in the long form it extends from about ammo 

25 acid 861 to 1162. FIG. 3 shows the amino acid sequence 

of a human ObR. The signal sequence extends from ammo 

acid residue 1 to about 20; the ECD extends from about 

amino acid residue 21 to about 839; the TM extends from 

about amino acid residue 840 to about 862; and the CD 

30 extends from about amino acid residue 863 to 1165. 

The ObR sequence begins with a methionine m a DNA 

sequence context consistent with a translation initiation 

site, followed by a typical hydrophobic signal sequence 

of peptide secretion. The predicted mature extracellular 
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domain for both forms of murine ObR is identical and is 
815 amino acids long, whereas the ECD predicted for human 
ObR is 819 amino acids long. The extracellular domain of 
ObR shows many features of the class I cytokine receptor 
5 family (reviewed in Heldin, 1995, Cell 80:213-223), and 
is most closely related to the gpl3 0 signal transducing 
component of the IL-6 receptor (Taga et al., 1989, Cell 
58:573-581), the G-CSF receptor (Fukunaga et al., 1990, 
Cell 61:341-350), and the LIF receptor (Gearing et al . , 

10 1991, Science 255:1434-1437). In fact, the data 

presented herein demonstrate that the long form ObR 
signals through activation of STAT proteins — a hallmark 
of the signal transduction pathway mediated by the IL-6 
type cytokine receptor family. 

15 An alignment between the extracellular domains of 

the murine ObR and gpl30 is shown in FIG. 4. Although 
the overall amino acid sequence identity between these 
two molecules is low (24%) , the characteristically 
conserved cysteine residues, the Trp-Ser-X-Trp-Ser motif 

2 0 (amino acid residues 317-321 and 620-624 in the murine 

sequence shown in FIG. 1; amino acid residues 319-323 and 
62 2-62 6 in the human sequence shown in FIG. 3) , and 
conservation of other residues within this group of 
proteins (reviewed in Kishimoto et al., 1994, Cell 
25 76:253-262) is clearly evident. The amino acid sequences 
of murine short form ObR and human ObR are highly 
homologous throughout the length of murine short form ObR 
(FIG . 5) . In fact, the deduced amino acid sequence 
identity between the murine short form and human clones 

3 0 (78%) is the same or greater than that seen when 

comparing the murine and human forms of gpl30 (Saito et 
al., 1992, J. Immunol. 148:4066-4071), the LIF receptor 
(Gough et al., 1988, Proc. Natl. Acad. Sci . 85:2623- 
2627), and the G-CSF receptor (Fukanaga et al., 1990, 
35 Proc. Natl. Acad. Sci. 87:8702-8706). Similarly, the 
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deduced amine acid sequences from murine and human long 
XL* or Oh* are homologous throughout, the length of the 
coding region and share 75% identity (FIG. 7) 

Potential N-linked glycosylate sites U^, 

k-y-s or N-X-T) are found in 

the ECU ol ^„ c i*. es ca n be identified 

^. . . xi- 1 inked qlycosylation sites can 
potential M shown in FIGs. 1 and 6 

in the murine OfcR ECD £ amino acid residueS 

(see tripep-ae mot, *- ^ 34? , 3 97 , 433, 

L0 23, 41, 56. 73 81, . . whereas at 

516, 624, 659, 67 0 688 697 , ^ can 

ieas : irrd z h EC : u«. * ™. 

be identified in th • at amin o acid residues 

3 .see -^^r ^, 3 X 431, 514, 622, 657, 66S , 
15 41, 56, 73, 98, * 75 ' extrace llular domain of both 

transmembrane of « « ld5 - OTcoaes . 5hort 

The murine =DH» shown in KG. 1 4 ^ 

„ cytopi.smic domain „4 — »=f n S ;. e Ml Mlno «id 
the murine OK cytopiasmic domarn U—- ^ 

residues 865 to 8B4 u. FM. l> -» in «, cellular domain 

of the UF receptor, and ^ ^ ^ 

25 sequence (NarazaKi e •' cDNA encodes a 

protein with a -J^^^. -ine short for. 
murine -^^^ Cains are hiohiy conserved up 
and human intraceliuia _. rln . short form obR, the 

30 to the final five residues of™ s 

human ^"^^^.TU--.. »' - 

to that of g P i30. The nu simi i a r 

Short form and ^ »™ «' 

throughout the coding region 
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but then diverge completely near the murine short form 

ObR stop codon. 

The short cytoplasmic domain of the murine short 
form cDNAs described herein is characteristic of several 
5 class I cytokine receptor polypeptides (reviewed in 

Kishimoto et al. . 1994. Cell 76:253-262). However the 
results reported herein demonstrate that the short form 
ObR does not activate signal transduction via the STAT 
pathway which is mediated by the long form ObR. In fact, 
10 the three receptors to which ObR shows the strongest 

homology all have long cytoplasmic domaxns important in 
intracellular signaling. This opened the possxbxlxty 
that the murine short form ObR clone isolated was 
chimeric or encoded a rare aberrantly spliced form not 
15 representing the major form expressed within the choroxd 
Plexus. To address this issue, eight murine clones were 
selected that were independently identified in the 
library screen, and each was amplified (xn subpools of 
X S0 clones each) by PCR with primers made to sequences 3 
20 of the stop codon. Results verified that all exght 

clones contained these same 3' untranslated sequences, 
in addition, the C-terminus of five independently 
isolated clones was sequenced and all shown to have the 
same stop codon. Finally, reverse transcription PCR wxth 
25 total RNA from choroid plexus isolated from a 

strain (C57Bl/KsJ) different from that whxch the cDNA 
library was derived, generated an identical PCR product _ 
containing a stop codon in the same location- ™Z\T 
indicated that the isolated murine short form clone xs 
30 neither chimeric nor a rare aberrant splice event but 
rather is likely to be the predominant form of thxs 
receptor in the murine choroid plexus. The data 
presented herein indicate that in some tissues, 
al ternatively spliced forms of mouse ObR exxst wxth 
35 longer intracellular domains (the long form) , 
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wild-type obR gene is expressed in two forms, one mRNA 
transcript having an insert of about 100 nucleotides 
encodes ObR having a short cytoplasmic domain, and 
another mRNA transcript encodes ObR having a long 
5 cytoplasmic domain that is homologous to the human CD. 

The murine cDNA shown in FIG . 6 encodes the long 
form ObR. As described supra, the amino acids encoding 
the ECD and TM of the murine long form ObR are identical 
to those for the murine short form. The murine long form 
10 cDNA, however, encodes a cytoplasmic domain (3 02 amino 
acids) that is approximately the same length as the 
cytoplasmic domain encoded by the human QbB. cDNA. Unlike 
the ObR short forms, the ObR encoded by the nucleotide 
sequence of the murine long form continues to be similar 
15 to that of the human ObR throughout the cytoplasmic 
domain. 

The data presented herein also indicate that db is 
a mutant of the long form murine sfeR gene. The db mutant 
expresses an aberrantly spliced transcript containing an 
20 insert of about 106 nucleotides in the portion of the 
MRNA encoding the CD. Although the transcript is long, 
the inserted sequence produces a mutation that results in 
a transcript that encodes a truncated ObR protein that is 
identical to the short forms of ObR and therefore, lacks 
25 most of the CD. The data shown herein demonstrate that, 
unlike the long form ObR, the short form ObR, i^, the 
form of the receptor associated with the obese phenotype 
in db/db mice, does not transduce signal mediated by the 
STAT pathway. Therefore, it appears that the signalling- 
30 competent long form ObR is actively involved in body 
weight regulation and maintenance. 

in sum, messenger RNA for several major ObR forms 
have been identified. The predominant ObR mRNA found in 
most tissues encodes a transmembrane protein with a short 
cytoplasmic domain of 34 amino acid residues referred to 



35 



WO 97/19952 



PCT/US96/19128 



- 37 - 

as the short form. In hypothalamus, an obR mRNA exists 
which encodes a protein with an identical extracellular 
domain as the short form, but with a 3 02 residue-long 
cytoplasmic domain, referred to as the long form. The db 
5 mutation leads to the production of an aberrant splice 
product of long form transcript, resulting in a protein 
with truncated cytoplasmic domain. Interestingly, the 
mRNA for the long form of ObR in the db/db mice encodes a 
protein with an identical structure to the naturally 
10 occurring short form. The loss of this carboxyterminal 
region is proposed to render the ObR inactive and is 
predicted to generate the obese phenotype in db/db mice. 

Sequence information indicated that ObR might 
exert a signaling action similar to that of G-CSFR, LIFR 
15 and gpl30 (Stahl & Yancopoulos, 1993, Cell 74:587-590; 

Kishimoto, et al., 1995, Blood 86:1243-1254). Signaling 
by these receptors entails, among others, the activation 
of receptor-associated kinases of the Janus kinase family 
which contribute to the phosphorylation and activation of 
20 the DNA binding activity of STAT1, STAT 3 and STATS (Ihle, 
1995, Nature 377:591-594; Kishimoto, et al . , 1995, Blood 
86:1243-1254). This process, in turn, has been 
correlated with induced transcription of genes that 
contain binding sites for the STAT proteins such as the 

2 5 hepatic genes encoding acute phase plasma proteins (Lai 

et al., 1995, J. Biol. Chera. 270:23254-23257). To 
address whether the cloned ObR isoforms are indeed 
signaling receptor molecules, ObR was introduced into 
established tissue culture cell lines and the cell 

3 0 response to OB treatment was compared with that mediated 

by the structurally-related IL-6-type cytokine receptors. 
The results presented in the example inf ra demonstrate 
that the long form ObR is a signal-transducing molecule. 
In particular, the results show that the long form ObR 
3 5 shares functional specificity with IL-6-type cytokine 
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receptors. The results also show that the short form ObR 
does not signal via the STAT pathway transduced by the 
ObR long form. Thus, it appears that the long form ObR, 
but not the short form, is involved in maintenance of 
5 body weight. 

The ObR amino acid sequences of the invention 
include the amino acid sequence shown in FIG. 1 (SEQ. ID. 
No: 2), FIG. 3 (SEQ. ID. No: 4) or FIG. 6, or the amino 
acid sequence encoded by cDNA clone famj5312 as deposited 
10 with the ATCC, or encoded by cDNA clone fahj5312d as 

deposited with the ATCC, or encoded by the human genomic 
clone h-obR-p87, as deposited with the ATCC. Further, 
ObRs of other species are encompassed by the invention. 
In fact, any ObR protein encoded by the obR nucleotide 
15 sequences described in Section 5.1, above, are within the 
scope of the invention. 

The invention also encompasses proteins that are 
functionally equivalent to the ObR encoded by the 
nucleotide sequences described in Section 5.1, as judged 
20 by any of a number of criteria, including but not limited 
to the ability to bind Ob, the binding affinity for Ob, 
the resulting biological effect of Ob binding, e.g. , 
signal transduction, a change in cellular metabolism 
( e.g. . ion flux, tyrosine phosphorylation) or change in 
25 phenotype when the ObR equivalent is present in an 
appropriate cell type (such as the amelioration, 
prevention or delay of the obese phenotype, i.e. , the db 
or ob phenotype) , or weight loss. Such functionally 
equivalent ObR proteins include but are not limited to 
3 0 additions or substitutions of amino acid residues within 
the amino acid sequence encoded by the obR nucleotide 
sequences described, above, in Section 5.1, but which 
result in a silent change, thus producing a functionally 
equivalent gene product. Amino acid substitutions may be 
35 made on the basis of similarity in polarity, charge, 
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solubility, hydrophobics, hydrophilicity , and/or the 
amphipathic nature of the residues involved. For 
example, nonpolar (hydrophobic) amino acids include 
alanine, leucine, isoleucine, valine, proline, 
5 phenylalanine, tryptophan, and methionine; polar neutral 
amino acids include glycine, serine, threonine, cystexne, 
tyrosine, asparagine, and glutamine; positively charged 
(basic) amino acids include arginine, lysine, and 
histidine; and negatively charged (acidic) amino acxds 
10 include aspartic acid and glutamic acid. 

While random mutations can be made to ob£ DMA 
(using random mutagenesis techniques well known to those 
skilled in the art) and the resulting mutant ObRs tested 
for activity, site-directed mutations of the obR codxng 
IS sequence can be engineered (using site-directed 

mutagenesis techniques well known to those skilled xn the 
art) to generate mutant ObRs with increased functxon, 
e , q ., higher binding affinity for Ob, and/or greater 
signalling capacity; or decreased function, e^., lower 
20 binding affinity for Ob, and/or decreased sxgnal 
transduction capacity. 

For example, the alignment of mouse short form 
{FIG 1) and the human ObR homolog (FIG. 3) is shown xn 
FIG. 5 in which identical amino acid residues are 
25 indicated by a star. Mutant ObRs can be engineered so 
that regions of identity (indicated by stars xn FIG. 5) 
are maintained, whereas the variable residues (unstarred 
in FIG. 5) are altered, *^u. by deletion or insertxon of 
an amino acid residue(s) or by substitution of one or 
30 more different amino acid residues. Conservatxve 

alterations at the variable positions can be engxneer d 
in order to produce a mutant ObR that retaxns functxon, 
Ob binding affinity or signal transduction 
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capability or both. Mo-conservative changes can be 
IZtlH at these variable positions to alter function, 
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e.g. , Ob binding affinity or signal transduction 
capability, or both. Alternatively, where alteration of 
function is desired, deletion or non-conservative 
alterations of the conserved regions (i.e. , identical 
5 amino acids indicated by stars in FIG. 5) can be 

engineered. For example, deletion or non-conservative 
alterations (substitutions or insertions) of the CD, 
e.g. , amino acid residues 861-894 (FIG. 1) of murine ObR, 
or amino acid residues 863-1165 (FIG. 3) of human ObR, or 
10 portions of the CD, e.g. . amino acid residues 861-884 
(FIG. 1) of murine ObR, or amino acid residues 863-886 
(FIG. 3) of human ObR (the box 1 Jak interaction domain) 
can be engineered to produce a mutant ObR that binds Ob 
but is signalling-incompetent. Non-conservative 
15 alterations to the starred residues in the ECD shown in 
FIG. 5 can be engineered to produce mutant ObRs with 
altered binding affinity for Ob. The same mutation 
strategy can also be used to design mutant ObRs based on 
the alignment of murine long ObR form and the human ObR 
20 homolog shown in FIG. 7 in which identical amino acid 
residues are indicated by a double asterisk. 

FIG. 4 shows the alignment of the ECD of murine 
ObR with human gpl30, in which identical residues are 
indicated in black, and conservative changes are 
25 indicated in grey. Presumably, regions of identity and 
conservation are important for maintaining tertiary 
structure of the ECD, whereas the variable regions may 
contribute to specificity of each receptor for its 
ligand. Therefore, ObR mutants with altered binding 
3 0 affinity for Ob may be engineered by altering the 

variable regions shown in FIG. 4. Such ObR mutants can 
be designed so as to preserve the ObR amino acid 
sequences that are boxed in FIG. 4 (both black and grey 
boxes) or to contain one or more conservative 
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substitutions derived from the gpl3 0 sequence shown in 
the grey boxes of FIG. 4. 

Other mutations to the obR coding sequence can be 
made to generate ObRs that are better suited for 
expression, scale up, etc. in the host cells chosen. For 
example, cysteine residues can be deleted or substituted 
with another amino acid in order to eliminate disulfide 
bridges; N-linked glycosylation sites can be altered or 
eliminated to achieve, for example, expression of a 
homogeneous product that is more easily recovered and 
purified from yeast hosts which are known to 
hyperglycosylate N-linked sites. To this end, a variety 
of amino acid substitutions at one or both of the first 
or third amino acid positions of any one or more of the 
15 glycosylation recognition sequences which occur in the 
ECD (N-X-S or N-X-T) , and/ or an amino acid deletion at 
the second position of any one or more such recognition 
sequences in the ECD will prevent glycosylation of the 
ObR at the modified tripeptide sequence. (See, e^o,., 
Miyajima et al. , 1986, EMBO J * 5 ( 6) : 1193-1197 ) 



10 



20 



Peptides corresponding to one or more domains of 
the ObR ( e.g. . ECD, TM or CD), truncated or deleted ObRs 
( e.g. . ObR in which the TM and/or CD is deleted) as well 
as fusion proteins in which the full length ObR, an ObR 
25 peptide or truncated ObR is fused to an unrelated protein 
are also within the scope of the invention and can be 
designed on the basis of the obR nucleotide and ObR amino 
acid sequences disclosed in this Section and in 
Section 5.1, above. Such fusion proteins include but are 
3 0 not limited to igFc fusions which stabilize the ObR 
protein or peptide and prolong half-life in vivo ; or 
fusions to any amino acid sequence that allows the fusion 
protein to be anchored to the cell membrane, allowing the 
ECD to be exhibited on the cell surface; or fusions to an 
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enzyme, fluorescent protein, or luminescent protein which 
provide a marker function. 

While the ObR polypeptides and peptides can be 
chemically synthesized (e.g. , see Creighton, 1983, 
5 Proteins: Structures and Molecular Principles, W.H. 

Freeman & Co., N.Y.), large polypeptides derived from the 
ObR and the full length ObR itself may advantageously be 
produced by recombinant DNA technology using techniques 
well known in the art for expressing nucleic acid 
10 containing obR gene sequences and/or coding sequences. 

Such methods can be used to construct expression vectors 
containing the obR nucleotide sequences described in 
Section 5.1 and appropriate transcriptional and 
translational control signals. These methods include, 
15 for example, in vitro recombinant DNA techniques, 

synthetic techniques, and in vivo genetic recombination. 
See, for example, the techniques described in Sambrook et 
al ' 1989, supra , and Ausubel et al. , 1989, supra. 
Alternatively, RNA capable of encoding obR nucleotide 
20 sequences may be chemically synthesized using, for 

example, synthesizers. See, for example, the techniques 
described in "Oligonucleotide synthesis", 1984, Gait, 
M.J. ed., IRL Press, Oxford, which is incorporated by 
reference herein in its entirety. 
25 A variety of host-expression vector systems may be 

utilized to express the obR nucleotide sequences of the 
invention. Where the ObR peptide or polypeptide is a 
soluble derivative (e^, ObR peptides corresponding to 
the ECD; truncated or deleted ObR in which the TM and/or 
CD are deleted) the peptide or polypeptide can be 
recovered from the culture, i^, from the host cell xn 
cases where the ObR peptide or polypeptide is not 
secreted, and from the culture media in cases where the 
ObR peptide or polypeptide is secreted by the cells. 
However, the expression systems also encompass engineered 
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host cells that express the ObR or functional equivalents 
in situ , i.e. . anchored in the cell membrane. 
Purification or enrichment of the ObR from such 
expression systems can be accomplished using appropriate 
5 detergents and lipid micelles and methods well known to 
those skilled in the art. However, such engineered host 
cells themselves may be used in situations where it is 
important not only to retain the structural and 
functional characteristics of the ObR, but to assess 
10 biological activity, e.g. . in drug screening assays. 

The expression systems that may be used for 
purposes of the invention include but are not limited to 
microorganisms such as bacteria (e^^, E. coli , &^ 
subtilis) transformed with recombinant bacteriophage DNA, 
15 plasmid DNA or cosmid DNA expression vectors containing 
ob R nucleotide sequences; yeast (e^, Saccharomvces , 
Pichia) transformed with recombinant yeast expression 
vectors containing the obR nucleotide sequences; insect 
cell systems infected with recombinant virus expression 
20 vectors (e^, baculovirus) containing the obR sequences; 
plant cell systems infected with recombinant virus 
expression vectors (e^, cauliflower mosaic virus, CaMV; 
tobacco mosaic virus, TMV) or transformed with 
recombinant plasmid expression vectors (a^, Ti plasmid) 
25 containing obR nucleotide sequences; or mammalian cell 
systems (e^, COS, CHO, BHK, 293, 3T3) harboring 
recombinant expression constructs containing promoters 
derived from the genome of mammalian cells ( e.q, , 
metallothionein promoter) or from mammalian viruses 
f e.a. . the adenovirus late promoter; the vaccinia virus 

7 . 5K promoter) . 

in bacterial systems, a number of expression 
vectors may be advantageously selected depending upon the 
use intended for the obR gene product being expressed. 
For example, when a large quantity of such a protein is 
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to be produced, for the generation of pharmaceutical 
compositions of ObR protein or for raising antibodies to 
the ObR protein, for example, vectors which direct the 
expression of high levels of fusion protein products that 
5 are readily purified may be desirable- Such vectors 
include, but are not limited, to the E ■ coli expression 
vector pUR278 (Ruther et al., 1983, EMBO J. 2:1791), in 
which the obR coding sequence may be ligated individually 
into the vector in frame with the lacZ coding region so 

10 that a fusion protein is produced; pIN vectors (Inouye & 
Inouye, 1985, Nucleic Acids Res. 13:3101-3109; Van Heeke 
& Schuster, 1989, J. Biol. Chem. 264:5503-5509); and the 
like, pGEX vectors may also be used to express foreign 
polypeptides as fusion proteins with glutathione 

15 S-transf erase (GST) . In general, such fusion proteins 
are soluble and can easily be purified from lysed cells 
by adsorption to glutathione-agarose beads followed by 
elution in the presence of free glutathione. The PGEX 
vectors are designed to include thrombin or factor Xa 

2 0 protease cleavage sites so that the cloned target gene 

product can be released from the GST moiety. 

In an insect system, Autoaraoha californica 
nuclear polyhidrosis virus (AcNPV) is used as a vector to 
express foreign genes. The virus grows in Soodoptera 

25 f ruqiperda cells. The obR gene coding sequence may be 
cloned individually into non-essential regions (for 
example the polyhedrin gene) of the virus and placed 
under control of an AcNPV promoter (for example the 
polyhedrin promoter) . Successful insertion of obR gene 

30 coding sequence will result in inactivation of the 
polyhedrin gene and production of non-occluded 
recombinant virus (i.e., virus lacking the proteinaceous 
coat coded for by the polyhedrin gene) . These 
recombinant viruses are then used to infect Spodoptera 

3 5 frugiperda cells in which the inserted gene is expressed. 
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(E.g. . see Smith et al., 1983 , J. Virol. 46: 584; Smith, . 
U.S. Patent No. 4,215,051). 

In mammalian host cells, a number of viral-based 
expression systems may be utilized. In cases where an 
5 adenovirus is used as an expression vector, the obR 
nucleotide sequence of interest may be ligated to an 
adenovirus transcription/translation control complex, 
e . a ., the late promoter and tripartite leader sequence. 
This chimeric gene may then be inserted in the adenovirus 
10 genome by in vitro or in vivo recombination. Insertion 
in a non-essential region of the viral genome (e.g. , 
region El or E3) will result in a recombinant virus that 
is viable and capable of expressing the obR gene product 
in infected hosts. ( E.g. , See Logan & Shenk, 1984, Proc. 
15 Natl. Acad. Sci. USA 81:3655-3659). Specific initiation 
signals may also be required for efficient translation of 
inserted obR nucleotide sequences. These signals include 
the ATG initiation codon and adjacent sequences. In 
cases where an entire obR gene or cDNA, including its own 
20 initiation codon and adjacent sequences, is inserted into 
the appropriate expression vector, no additional 
translational control signals may be needed. However, in 
cases where only a portion of the obR coding sequence is 
inserted, exogenous translational control signals, 

2 5 including, perhaps, the ATG initiation codon, must be 

provided. Furthermore, the initiation codon must be in 
phase with the reading frame of the desired coding 
sequence to ensure translation of the entire insert. 
These exogenous translational control signals and 

3 0 initiation codons can be of a variety of origins, both 

natural and synthetic. The efficiency of expression may 
be enhanced by the inclusion of appropriate transcription 
enhancer elements, transcription terminators, etc. (See 
Bittner et al., 1987, Methods in Enzymol. 153:516-544). 
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In addition, a host cell strain may be chosen 
which modulates the expression of the inserted sequences, 
or modifies and processes the gene product in the 
specific fashion desired. Such modifications ( e.g. . 
5 glycosylation) and processing ( e.g. , cleavage) of protein 
products may be important for the function of the 
protein. Different host cells have characteristic and 
specific mechanisms for the post-translational processing 
and modification of proteins and gene products, 

10 Appropriate cell lines or host systems can be chosen to 
ensure the correct modification and processing of the 
foreign protein expressed. To this end, eukaryotic host 
cells which possess the cellular machinery for proper 
processing of the primary transcript, glycosylation, and 

15 phosphorylation of the gene product may be used. Such 
mammalian host cells include but are not limited to CHO, 
VERO, BHK, HeLa, COS, MDCK, 293, 3T3 , WI38, and in 
particular, choroid plexus cell lines. 

For long-term, high-yield production of recombi- 

20 nant proteins, stable expression is preferred. For 
example, cell lines which stably express the obR 
sequences described above may be engineered. Rather than 
using expression vectors which contain viral origins of 
replication, host cells can be transformed with DNA 

2 5 controlled by appropriate expression control elements 

( e.g. . promoter, enhancer sequences, transcription 
terminators, polyadenylation sites, etc.), and a 
selectable marker. Following the introduction of the 
foreign DNA, engineered cells may be allowed to grow for 

3 0 1-2 days in an enriched media, and then are switched to a 

selective media. The selectable marker in the 
recombinant plasmid confers resistance to the selection 
and allows cells to stably integrate the plasmid into 
their chromosomes and grow to form foci which in turn can 
3 5 be cloned and expanded into cell lines. This method may 
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advantageously be used to engineer cell lines which 
express the obB gene product. Such engineered cell lines 
may be particularly useful in screening and evaluation of 
compounds that affect the endogenous activity of the obR 

gene product. 

A number of selection systems may be used, 
including but not limited to the herpes simplex virus 
thymidine kinase (Wigler, et al., 1977, Cell 11:223), 
hypoxanthine-guanine phosphor ibosy transferase (Szybalska 
& Szybalski, 1962, Proc. Natl. Acad. Sci. USA 48:2026), 
and adenine phosphoribosyltransf erase (Lowy, et al., 
1980, Cell 22:817) genes can be employed in tk~, hgprt or 
aprt" cells, respectively. Also, antimetabolite 
resistance can be used as the basis of selection for the 
15 following genes: dhfr, which confers resistance to 

methotrexate (Wigler, et al. , 1980, Natl. Acad. Sci. USA 
77:3567; O'Hare, et al., 1981, Proc. Natl. Acad. Sci. USA 
78-1527); gpt, which confers resistance to mycophenolic 
acid (Mulligan & Berg, 1981, Proc. Natl. Acad. Sci. USA 
20 78:2072); neo , which confers resistance to the 

aminoglycoside G-418 (Colberre-Garapin, et al. , 1981, J. 
Mol. Biol. 150:1); and hygro, which confers resistance to 
hygromycin (Santerre, et al., 1984, Gene 30:147). 

Alternatively, any fusion protein may be readily 
25 purified by utilizing an antibody specific for the fusion 
protein being expressed. For example, a system described 
by Janknecht et al. allows for the ready purification of 
non-denatured fusion proteins expressed in human cell 
lines (Janknecht, et al. , 1991, Proc. Natl. Acad. Sci. 
30 USA 88: 8972-8976). In this system, the gene of interest 
is subcloned into a vaccinia recombination plasmid such 
that the gene's open reading frame is translationally 
fused to an amino-terminal tag consisting of six 
histidine residues. Extracts from cells infected with 
35 recombinant vaccinia virus are loaded onto 
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Ni 2 + * nitriloacetic acid-agarose columns and histidine- 
tagged proteins are selectively eluted with imidazole- 
containing buffers. 

The obR gene products can also be expressed in 
5 transgenic animals. Animals of any species, including, 
but not limited to, mice, rats, rabbits, guinea pigs, 
pigs, micro-pigs, goats, and non-human primates, e.g. , 
baboons, monkeys, and chimpanzees may be used to generate 
obR transgenic animals. 
!0 Any technique known in the art may be used to 

introduce the obR transgene into animals to produce the 
founder lines of transgenic animals. Such techniques 
include, but are not limited to pronuclear microinjection 
(Hoppe, P.C. and Wagner, -33T.E. P 1989, U.S. Pat. No. 
15 4,873,191); retrovirus mediated gene transfer into germ 
lines (Van der Putten et al. , 1985, Proc. Natl. Acad. 
Sci., USA 82:6148-6152); gene targeting in embryonic stem 
cells (Thompson et al. , 1989, Cell 56:313-321); 
electroporation of embryos (Lo f 1983, Mol Cell. Biol. 
20 3:1803-1814); and sperm-mediated gene transfer (Lavitrano 
et al., 1989, Cell 57:717-723); etc. For a review of 
such techniques, see Gordon, 1989, Transgenic Animals, 
Intl. Rev. cytol. 115:171-229, which is incorporated by 
reference herein in its entirety. 
25 The present invention provides for transgenic 

animals that carry the pbE transgene in all their cells, 
as well as animals which carry the transgene in some, but 
not all their cells, i.e. , mosaic animals. The transgene 
may be integrated as a single transgene or in 
30 concatamers, e.g. . head-to-head tandems or head-to-tail 
tandems. The transgene may also be selectively 
introduced into and activated in a particular cell type 
by following, for example, the teaching of Lasko et al. 
(Lasko, M. et al. , 1992, Proc. Natl. Acad. Sci. USA 89: 
35 6232-6236). The regulatory sequences required for such a 
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cell-type specific activation will depend upon the 
particular cell type of interest, and will be apparent to 
those of skill in the art. When it is desired that the 
obR gene transgene be integrated into the chromosomal 
5 site of the endogenous obR gene, gene targeting is 
preferred. Briefly, when such a technique is to be 
utilized, vectors containing some nucleotide sequences 
homologous to the endogenous obR gene are designed for 
the purpose of integrating, via homologous recombination 

.0 with chromosomal sequences, into and disrupting the 

function of the nucleotide sequence of the endogenous obR 
gene. The transgene may also be selectively introduced 
into a particular cell type, thus inactivating the 
endogenous obR gene in only that cell type, by following, 

.5 for example, the teaching of Gu et al. (Gu, et al. , 1994, 
Science 265: 103-106) . The regulatory sequences required 
for such a cell-type specific inactivation will depend 
upon the particular cell type of interest, and will be 
apparent to those of skill in the art. 

0 Once transgenic animals have been generated, the 

expression of the recombinant obR gene may be assayed 
utilizing standard techniques. Initial screening may be 
accomplished by Southern blot analysis or PGR techniques 
to analyze animal tissues to assay whether integration of 

5 the transgene has taken place. The level of mRNA 
expression of the transgene in the tissues of the 
transgenic animals may also be assessed using techniques 
which include but are not limited to Northern blot 
analysis of tissue samples obtained from the animal, in 

0 situ hybridization analysis, and RT-PCR. Samples of obR 
gene-expressing tissue, may also be evaluated 
iramunocytochemically using antibodies specific for the 
obR transgene product. 
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C. . ANTIBODIES TO ObR PROTEIKS 

Antibodies that specifically recognize one or more 
epitopes of ObR, or epitopes of conserved variants of 
ObR, or peptide fragments of the ObR are also encompassed 
5 by the invention. Such antibodies include but are not 
limited to polyclonal antibodies, monoclonal antibodies 
(mAbs) , humanized or chimeric antibodies, single chain 
antibodies, Fab fragments, F(ab') 2 fragments, fragments 
produced by a Fab expression library, anti-idiotypic 
10 (anti-Id) antibodies, and epitope-binding fragments of 
any of the above. 

The antibodies of the invention may be used, for 
example, in the detection of the ObR in a biological 
sample and may, therefore, be utilized as part of a 
15 diagnostic or prognostic technique whereby patients may 
be tested for abnormal amounts of ObR. Such antibodies 
may also be utilized in conjunction with, for example, 
compound screening schemes, as described, below, in 
Section 5.5, for the evaluation of the effect of test 
20 compounds on expression and/or activity of the obR gene 
product. Additionally, such antibodies can be used in 
conjunction with the gene therapy techniques described, 
below, in Section 5.6, to, for example, evaluate the 
normal and/or engineered obR-expressing cells prior to 
25 their introduction into the patient. Such antibodies may 
additionally be used as a method for the inhibition of 
abnormal ObR activity. Thus, such antibodies may, 
therefore, be utilized as part of weight disorder 
treatment methods . 
30 F or the production of antibodies, various host 

animals may be immunized by injection with the ObR, an 
ObR peptide r e.q. ■ one corresponding the a functional 
domain of the receptor, such as BCD, TM or CD), truncated 
ObR polypeptides (ObR in which one or more domains, e^^, 
3 5 the TM or CD, has been deleted) , functional equivalents 
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of the ObR or mutants of the ObR. Such host animals may 
include but are not limited to rabbits, mice, and rats, 
to name but a few. Various adjuvants may be used to 
increase the immunological response, depending on the 
5 host species, including but not limited to Freund's 

(complete and incomplete) , mineral gels such as aluminum 
hydroxide, surface active substances such as 
lysolecithin, pluronic polyols, polyanions, peptides, oil 
emulsions, keyhole limpet hemocyanin, dinitrophenol , and 
10 potentially useful human adjuvants such as BCG (bacille 
Calmette-Guerin) and rnrvnptocterium parvum. Polyclonal 
antibodies are heterogeneous populations of antibody 
molecules derived from the sera of the immunized animals. 
Monoclonal antibodies, which are homogeneous 
15 populations of antibodies to a particular antigen, may be 
obtained by any technique which provides for the 
production of antibody molecules by continuous cell lines 
in culture. These include, but are not limited to, the 
hybridoma technique of Kohler and Milstein, (1975, Nature 
20 256:495-497; and U.S. Patent No. 4,376,110), the human B- 
cell hybridoma technique (Kosbor et al. , 1983, Immunology 
Today 4:72; Cole et al., 1983, Proc. Natl. Acad. Sci. USA 
80:2026-2030), and the EBV-hybridoma technique (Cole et 
al., 1985, Monoclonal Antibodies And cancer Therapy, Alan 
25 R. Liss, inc., pp. 77-96). Such antibodies may be of any 
immunoglobulin class including IgG, IgM, IgE, IgA, IgD 
and any subclass thereof. The hybridoma producing the 
mAb of this invention may be cultivated in vitro or in 
vivo. Production of high titers of mAbs in vivo makes 
3 0 this the presently preferred method of production. 

In addition, techniques developed for the 
production of "chimeric antibodies" (Morrison et al. , 
1984, Proc. Natl. Acad. Sci., 81:6851-6855; Neuberger et 
al., 1984, Nature, 312:604-608; Takeda et al., 1985, 
35 Nature, 314:452-454) by splicing the genes from a mouse 
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antibody molecule of appropriate antigen specificity 
together with genes from a human antibody molecule of 
appropriate biological activity can be used. A chimeric 
antibody is a molecule in which different portions are 
5 derived from different animal species, such as those 

having a variable region derived from a murine mAb and a 
human immunoglobulin constant region. 

Alternatively, techniques described for the 
production of single chain antibodies (U.S. Patent 

10 4,946,778; Bird, 1988, Science 242:423-426; Huston et 

al., 1988, Proc. Natl. Acad. Sci. USA 85:5879-5883; and 
Ward et al., 1989, Nature 334:544-546) can be adapted to 
produce single chain antibodies against obR gene 
products. Single chain antibodies are formed by linking 

15 the heavy and light chain fragments of the Fv region via 
an amino acid bridge, resulting in a single chain 
polypeptide * 

Antibody fragments which recognize specific 
epitopes may be generated by known techniques. For 

20 example, such fragments include but are not limited to: 
the F(ab') 2 fragments which can be produced by pepsin 
digestion of the antibody molecule and the Fab fragments 
which can be generated by reducing the disulfide bridges 
of the F(ab') 2 fragments. Alternatively, Fab expression 

25 libraries may be constructed (Huse et al., 1989, Science, 
246:1275-1281) to allow rapid and easy identification of 
monoclonal Fab fragments with the desired specificity. 

Antibodies to the ObR can, in turn, be utilized to 
generate anti-idiotype antibodies that "mimic" the ObR, 

30 using techniques well known to those skilled in the art. 
(See, e.g. . Greenspan & Bona, 1993, FASEB J 7 ( 5) : 437-444 ; 
and Nissinoff , 1991, J. Immunol. 147 (8) : 2429-2438) . For 
example antibodies which bind to the ObR ECD and 
competitively inhibit the binding of Ob to the ObR can be 

35 used to generate anti-idiotypes that "mimic" the ECD and, 
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therefore, bind and neutralize Ob. Such neutralizing 
anti-idiotypes or Fab fragments of such anti-idiotypes 
can be used in therapeutic regimens to neutralize Ob and 
promote weight gain. 

5 D. . DIAGNOSIS OF BODY WEIGHT 

DISORDER ABNORMALITIES 

A variety of methods can be employed for the 
diagnostic and prognostic evaluation of body weight 
disorders, including obesity, cachexia and anorexia, and 
10 for the identification of subjects having a 
predisposition to such disorders. 

Such methods may, for example, utilize reagents 
such as the obR nucleotide sequences described in Section 
5.1, and ObR antibodies, as described, in Section 5.3. 
15 Specifically, such reagents may be used, for example, 
for: (1) the detection of the presence of obR gene 
mutations, or the detection of either over- or under- 
expression of obR mRNA relative to the non-body weight 
disorder state; (2) the detection of either an over- or 
2 0 an under-abundance of obR gene product relative to the 
non-body weight disorder state; and (3) the detection of 
perturbations or abnormalities in the signal transduction 
pathway mediated by ObR. 

The methods described herein may be performed, for 
2 5 example, by utilizing pre-packaged diagnostic kits 

comprising at least one specific obR nucleotide sequence 
or ObR antibody reagent described herein, which may be 
conveniently used, e.g. , in clinical settings, to 
diagnose patients exhibiting body weight disorder 
30 abnormalities. 

For the detection of obR mutations, any nucleated 
cell can be used as a starting source for genomic nucleic 
acid. For the detection of obR gene expression or obR 
gene products, any cell type or tissue in which the obE 
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gene is expressed, such as, for example, choroid plexus 
cells, may be utilized. 

Nucleic acid-based detection techniques are 
described, below, in Section 5.4.1* Peptide detection 
5 techniques are described, below, in Section 5.4.2. 

1 . . DETECTION OF THE obR GENE 

AND TRANSCRIPTS 



10 



Mutations within the obR gene can be detected by 
utilizing a number of techniques. Nucleic acid from any 
nucleated cell can be used as the starting point for such 
assay techniques, and may be isolated according to 
standard nucleic acid preparation procedures which are 
well known to those of skill in the art. 

DNA may be used in hybridization or amplification 
15 assays of biological samples to detect abnormalities 

involving obR gene structure, including point mutations, 
insertions, deletions and chromosomal rearrangements. 
Such assays may include, but are not limited to, Southern 
analyses, single stranded conformational polymorphism 
2 0 analyses (SSCP) , and PCR analyses. 

Such diagnostic methods for the detection of obR 
gene-specific mutations can involve for example, 
contacting and incubating nucleic acids including 
recombinant DNA molecules, cloned genes or degenerate 

2 5 variants thereof, obtained from a sample, ej^, derived 

from a patient sample or other appropriate cellular 
source, with one or more labeled nucleic acid reagents 
including recombinant DNA molecules, cloned genes or 
degenerate variants thereof, as described in Section 5.1, 

3 0 under conditions favorable for the specific annealing of 

these reagents to their complementary sequences within 
the obR gene. Preferably, the lengths of these nucleic 
acid reagents are at least 15 to 3 0 nucleotides. After 
incubation, all non-annealed nucleic acids are removed 
35 from the nucleic acidrobR molecule hybrid. The presence 
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of nucleic acids which have hybridized, if any such 
molecules exist, is then detected. Using such a 
detection scheme, the nucleic acid from the cell type or 
tissue of interest can be immobilized, for example, to a 
5 solid support such as a membrane, or a plastic surface 
such as that on a microtiter plate or polystyrene beads. 
In this case, after incubation, non-annealed, labeled 
nucleic acid reagents of the type described in Section 
5.1 are easily removed. Detection of the remaining, 

10 annealed, labeled obR nucleic acid reagents is 

accomplished using standard techniques well-known to 
those in the art. The obR gene sequences to which the 
nucleic acid reagents have annealed can be compared to 
the annealing pattern expected from a normal obR gene 

15 sequence in order to determine whether an obR gene 
mutation is present. 

Alternative diagnostic methods for the detection 
of obR gene specific nucleic acid molecules, in patient 
samples or other appropriate cell sources, may involve 

20 their amplification, e.g. , by PCR (the experimental 

embodiment set forth in Mullis, K.B. , 1987, U.S. Patent 
No. 4,683,202), followed by the detection of the 
amplified molecules using techniques well known to those 
of skill in the art. The resulting amplified sequences 

2 5 can be compared to those which would be expected if the 

nucleic acid being amplified contained only normal copies 
of the obR gene in order to determine whether an obR gene 
mutation exists. 

Additionally, well-known genotyping techniques can 

3 0 be performed to identify individuals carrying obR gene 

mutations. Such techniques include, for example, the use 
of restriction fragment length polymorphisms (RFLPs) , 
which involve sequence variations in one of the 
recognition sites for the specific restriction enzyme 
3 5 used. 
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Additionally, improved methods for analyzing DNA 
polymorphisms which can be utilized for the 
identification of obR gene mutations have been described 
which capitalize on the presence of variable numbers of 
short, tandemly repeated DNA sequences between the 
restriction enzyme sites. For example, Weber (U.S. Pat. 
No. 5,075,217, which is incorporated herein by reference 
in its entirety) describes a DNA marker based on length 
polymorphisms in blocks of (dC-dA) n- (dG-dT) n short tandem 
repeats. The average separation of (dC-dA) n- (dG-dT) n 
blocks is estimated to be 30,000-60,000 bp. Markers 
which are so closely spaced exhibit a high frequency co- 
inheritance, and are extremely useful in the 
identification of genetic mutations, such as, for 
example, mutations within the obR gene, and the diagnosis 
of diseases and disorders related to obR mutations. 

Also, Caskey et al. (U.S. Pat. No. 5,364,759, 
which is incorporated herein by reference in its 
entirety) describe a DNA profiling assay for detecting 
short tri and tetra nucleotide repeat sequences. The 
process includes extracting the DNA of interest, such as 
the obR gene, amplifying the extracted DNA, and labelling 
the repeat sequences to form a genotypic map of the 

individual's DNA. 
25 Th e level of obR gene expression can also be 

assayed by detecting and measuring obR transcription. 
For example, RNA from a cell type or tissue known, or 
suspected to express the ob£ gene, such as brain, 
especially choroid plexus cells, may be isolated and 
30 tested utilizing hybridization or PGR techniques such as 
are described, above. The isolated cells can be derived 
from cell culture or from a patient. The analysis of 
cells taken from culture may be a necessary step m the 
assessment of cells to be used as part of a cell-based 
35 gene therapy technique or, alternatively, to test the 
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effect of compounds on the expression of the obR gene. 
Such analyses may reveal both quantitative and 
qualitative aspects of the expression pattern of the pbR 
gene, including activation or inactivation of obR gene 

5 expression. 

In one embodiment of such a detection scheme, 
cDNAs are synthesized from the RNAs of interest (e^, by 
reverse transcription of the RNA molecule into cDNA) . A 
sequence within the cDNA is then used as the template for 
10 a nucleic acid amplification reaction, such as a PCR 
amplification reaction, or the like. The nucleic acid 
reagents used as synthesis initiation reagents (e^, 
primers) in the reverse transcription and nucleic acid 
amplification steps of this method are chosen from among 
15 the obj* nucleic acid reagents described in Section 5.1. 
The preferred lengths of such nucleic acid reagents are 
at least 9-30 nucleotides. For detection of the 
amplified product, the nucleic acid amplification may be 
performed using radioactively or non-radioactively 
20 labeled nucleotides. Alternatively, enough amplified 
product may be made such that the product may be 
visualized by standard ethidium bromide staining or by 
utilizing any other suitable nucleic acid staining 
method* 

Additionally, it is possible to perform such fibJR 
gene expression assays "in situ", i^_, directly upon 
tissue sections (fixed and/or frozen) of patient tissue 
obtained from biopsies or resections, such that no 
nucleic acid purification is necessary. Nucleic acid 
reagents such as those described in Section 5 . 1 may be 
used as probes and/or primers for such in situ procedures 
(See, for example, Nuovo, G.J., 1992, "PCR In Situ 
Hybridization: Protocols And Applications", Raven Press, 
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Alternatively, if a sufficient quantity of the 
appropriate cells can be obtained, standard Northern 
analysis can be performed to determine the level of mRNA 
expression of the obR gene. 

5 2 . • DETECTION OF THE ObR GENE PRODUCTS 

Antibodies directed against wild type or mutant 
obR gene products or conserved variants or peptide 
fragments thereof, which are discussed, above, in Section 
5.3, may also be used as body weight disorder diagnostics 
10 and prognostics,- as described herein. Such diagnostic 

methods, may be used to detect abnormalities in the level 
of obR gene expression, or abnormalities in the structure 
and/or temporal, tissue, cellular, or subcellular 
location of the ObR, and may be performed in vivo or in 
15 vitro , such as, for example, on biopsy tissue. 

For example, antibodies directed to epitopes of 
the ObR ECD can be used in vivo to detect the pattern and 
level of expression of the ObR in the body. Such 
antibodies can be labeled, e.g. , with a radio-opaque or 
2 0 other appropriate compound and injected into a subject in 
order to visualize binding to the ObR expressed in the 
body using methods such as X-rays, CAT-scans, or MRI . 
Labeled antibody fragments, e.g. . the Fab or single chain 
antibody comprising the smallest portion of the antigen 

2 5 binding region, are preferred for this purpose to promote 

crossing the blood-brain barrier and permit labeling ObRs 
expressed in the choroid plexus. 

Additionally, any ObR fusion protein or ObR 
conjugated protein whose presence can be detected, can be 

3 0 administered • For example, ObR fusion or conjugated 

proteins labeled with a radio-opaque or other appropriate 
compound can be administered and visualized in vivo, as 
discussed, above for labeled antibodies. Further such Ob 
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utilized for in vitro diagnostic procedures. 

Alternatively, immunoassays or fusion protein 
detection assays, as described above, can be utilized on 
5 biopsy and autopsy samples in vitro to permit assessment 
of the expression pattern of the ObR. Such assays are 
not confined to the use of antibodies that define the ObR 
ECD, but can include the use of antibodies directed to 
epitopes of any of the domains of the ObR, e.g. . the ECD, 
10 the TM and/or CD. The use of each or all of these 
labeled antibodies will yield useful information 
regarding translation and intracellular transport of the 
ObR to the cell surface, and can identify defects in 
processing. 

15 The tissue or cell type to be analyzed will 

generally include those which are known, or suspected, to 
express the obR gene, such as, for example, choroid 
plexus cells. The protein isolation methods employed 
herein may, for example, be such as those described in 

20 Harlow and Lane (Harlow, E. and Lane, D. , 1988, 

"Antibodies: A Laboratory Manual", Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, New York) , which is, 
incorporated herein by reference in its entirety. The 
isolated cells can be derived from cell culture or from a 

2 5 patient. The analysis of cells taken from culture may be 

a necessary step in the assessment of cells that could be 
used as part of a cell-based gene therapy technique or, 
alternatively, to test the effect of compounds on the 
expression of the obR gene. 

3 0 For example, antibodies, or fragments of 

antibodies, such as those described, above, in Section 
5.3, useful in the present invention may be used to 
quantitatively or qualitatively detect the presence of 
obR gene products or conserved variants or peptide 
3 5 fragments thereof. This can be accomplished, for 
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example, by immunofluorescence techniques employing a 
fluorescently labeled antibody (see below, this Section) 
coupled with light microscopic, flow cytometric, or 
fluorimetric detection. Such techniques are especially 
5 preferred if such obR gene products are expressed on the 
cell surface. 

The antibodies (or fragments thereof) or Ob fusion 
or conjugated proteins useful in the present invention 
may, additionally, be employed histologically, as in 
10 immunofluorescence, immunoelectron microscopy or non- 

immuno assays, for in situ detection of obR gene products 
or conserved variants or peptide fragments thereof, or 
for Ob binding (in the case of labeled Ob fusion 
protein) . 

15 In situ detection may be accomplished by removing 

a histological specimen from a patient, and applying 
thereto a labeled antibody or fusion protein of the 
present invention. The antibody (or fragment) or fusion 
protein is preferably applied by overlaying the labeled 

20 antibody (or fragment) onto a biological sample. Through 
the use of such a procedure, it is possible to determine 
not only the presence of the obR gene product, or 
conserved variants or peptide fragments, or Ob binding, 
but also its distribution in the examined tissue. Using 

25 the present invention, those of ordinary skill will 
readily perceive that any of a wide variety of 
histological methods (such as staining procedures) can be 
modified in order to achieve such in situ detection. 

Immunoassays and non-immunoassays for obR gene 

3 0 products or conserved variants or peptide fragments 

thereof will typically comprise incubating a sample, such 
as a biological fluid, a tissue extract, freshly 
harvested cells, or lysates of cells which have been 
incubated in cell culture, in the presence of a 

3 5 detectably labeled antibody capable of identifying obR 
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15 



gene products or conserved variants or peptide fragments 
thereof, and detecting the bound antibody by any of a 
number of techniques well-known in the art. 

The biological sample may be brought in contact 
with and immobilized onto a solid phase support or 
carrier such as nitrocellulose, or other solid support 
which is capable of immobilizing cells, cell particles or 
soluble proteins. The support may then be washed with 
suitable buffers followed by treatment with the 
detectably labeled ObR antibody or Ob fusion protein. 
The solid phase support may then be washed with the 
buffer a second time to remove unbound antibody or fusion 
protein. The amount of bound label on solid support may 
then be detected by conventional means. 

By "solid phase support or carrier" is intended 
any support capable of binding an antigen or an antibody. 
Well-known supports or carriers include glass, 
polystyrene, polypropylene, polyethylene, dextran, nylon, 
amylases, natural and modified celluloses, 
2 0 polyacrylamides, gabbros, and magnetite. The nature of 
the carrier can be either soluble to some extent or 
insoluble for the purposes of the present invention. The 
support material may have virtually any possible 
structural configuration so long as the coupled molecule 
25 is capable of binding to an antigen or antibody. Thus, 
the support configuration may be spherical, as in a bead, 
or cylindrical, as in the inside surface of a test tube,' 
or the external surface of a rod. Alternatively, the 
surface may be flat such as a sheet, test strip, etc. 
30 Preferred supports include polystyrene beads. Those 

skilled in the art will know many other suitable carriers 
for binding antibody or antigen, or will be able to 
ascertain the same by use of routine experimentation. 

The binding activity of a given lot of ObR 
antibody or Ob fusion protein may be determined according 
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to well Jcnown methods. Those skilled in the art will be 
able to determine operative and optimal assay conditions 
for each determination by employing routine 
experimentation . 
5 With respect to antibodies, one of the ways in 

which the ObR antibody can be detectably labeled is by 
linking the same to an enzyme and use in an enzyme 
immunoassay (EIA) (Voller, A. , "The Enzyme Linked 
Immunosorbent Assay (ELISA)", 1978, Diagnostic Horizons 
10 2:1-7, Microbiological Associates Quarterly Publication, 
Walkersville, KD) ; Voller, A. et al., 1978, J. Clin. 
Pathol. 31:507-520; Butler, J.E., 1981, Meth. Enzymol. 
73:482-523; Maggio, E. (ed.), 1980, Enzyme Immunoassay, 
CRC Press, Boca Raton, FL, ; Ishikawa, E. et al., (eds.), 
15 1981, Enzyme Immunoassay, Kgaku Shoin, Tokyo). The 

enzyme which is bound to the antibody will react with an 
appropriate substrate, preferably a chromogenic 
substrate, in such a manner as to produce a chemical 
moiety which can be detected, for example, by 
20 spectrophotometric , fluorimetric or by visual means. 
Enzymes which can be used to detectably label the 
antibody include, but are not limited to, malate 
dehydrogenase, staphylococcal nuclease, delta-5-steroid 
isomerase, yeast alcohol dehydrogenase, alpha- 
25 glycerophosphate, dehydrogenase, triose phosphate 

isomerase, horseradish peroxidase, alkaline phosphatase, 
asparaginase, glucose oxidase, beta-galactosidase , 
r ibonuclease , urease , catalase , glucose-6-phosphate 
dehydrogenase, glucoamylase and acetylcholinesterase. 
3 0 The detection can be accomplished by color imetric methods 
which employ a chromogenic substrate for the enzyme. 
Detection may also be accomplished by visual comparison 
of the extent of enzymatic reaction of a substrate in 
comparison with similarly prepared standards. 
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Detection may also be accomplished using any of a 
variety of other immunoassays. For example, by 
radioactively labeling the antibodies or antibody 
fragments, it is possible to detect ObR through the use 
5 of a radioimmunoassay (RIA) (see, for example, Weintraub 
B., Principles of Radioimmunoassays, Seventh Training 
Course on Radioligand Assay Techniques, The Endocrine 
Society, March, 1986, which is incorporated by reference 
herein) . The radioactive isotope can be detected by such 
10 means as the use of a gamma counter or a scintillation 
counter or by autoradiography. 

It is also possible to label the antibody with a 
fluorescent compound. When the f luorescently labeled 
antibody is exposed to light of the proper wave length, 
15 its presence can then be detected due to fluorescence.' 
Among the most commonly used fluorescent labeling 
compounds are fluorescein isothiocyanate, rhodamine, 
phycoerythrin, phycocyanin, allophycocyanin, o- 
phthaldehyde and f luorescamine. 
20 The antibody can also be detectably labeled using 

fluorescence emitting metals such as 152 Eu, or others of 
the lanthanide series. These metals can be attached to 
the antibody using such metal chelating groups as 
diethylenetriaminepentacetic acid (DTPA) or 
5 ethylenediaminetetraacetic acid (EDTA) . 

The antibody also can be detectably labeled by 
coupling it to a chemi luminescent compound. The presence 
of the chemiluminescent-tagged antibody is then 
determined by detecting the presence of luminescence that 
0 arises during the course of a chemical reaction. 

Examples of particularly useful chemi luminescent labeling 
compounds are luminol, isoluminol, theromatic acridinium 
ester, imidazole, acridinium salt and oxalate ester. 

Likewise, a bioluminescent compound may be used to 
5 label the antibody of the present invention. 
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Bioluminescence is a type of chemi luminescence found in 
biological systems in, which a catalytic protein 
increases the efficiency of the chemi luminescent 
reaction. The presence of a bioluminescent protein is 
5 determined by detecting the presence of luminescence. 
Important bioluminescent compounds for purposes of 
labeling are luciferin, luciferase and aequorin. 

E • . SCREENING ASSAYS FOR COMPOUNDS THAT 

MODUIiATE ObR EXPRESSION OR ACTIVITY 

10 The following assays are designed to identify 

compounds that interact with ( e.g. . bind to) ObR 
(including, but not limited to the ECD or CD of ObR), 
compounds that interact with (e.g. . bind to) 
intracellular proteins that interact with ObR (including, 
15 but not limited to, the TM and CD of ObR), compounds that 
interfere with the interaction of ObR with transmembrane 
or intracellular proteins involved in ObR-mediated signal 
transduction, and to compounds which modulate the 
activity of obR gene ( i.e. . modulate the level of obR 
20 gene expression) or modulate the level of ObR. Assays 
may additionally be utilized which identify compounds 
which bind to obR gene regulatory sequences (e.g. , 
promoter sequences) and which may modulate obR gene 
expression. See e.g. , Piatt, K.A. , 1994, J. Biol. Chem. ■ 
25 269 :28558-28562, which is incorporated herein by 
reference in its entirety. 

The compounds which may be screened in accordance 
with the invention include, but are not limited to 
peptides, antibodies and fragments thereof, and other 
30 organic compounds ( e.g. , peptidomimetics) that bind to 
the ECD of the ObR and either mimic the activity 
triggered by the natural ligand ( i.e. , agonists) or 
inhibit the activity triggered by the natural ligand 
f i.e. , antagonists); as well as peptides, antibodies or 
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fragments thereof, and other organic compounds that mimic 
the ECD of the ObR (or a portion thereof) and bind to and 
"neutralize" natural ligand. 

Such compounds may include, but are not limited 
5 to, peptides such as, for example, soluble peptides, 
including but not limited to members of random peptide 
libraries; (see, e.g.. Lam, K.S. et al . , 1991, Nature 
254:82-84; Houghten, R. et al., 1991, Nature 354 : 84-86) f 
and combinatorial chemistry-derived molecular library 

10 made of D- and/or L- configuration amino acids, 

phosphopeptides (including, but not limited to, members 
of random or partially degenerate, directed 
phosphopeptide libraries; see, e.g. . Songyang, Z. et al. , 
1993, Cell 72:767-778), antibodies (including, but not 

15 limited to, polyclonal, monoclonal, humanized, anti- 

idiotypic, chimeric or single chain antibodies, and FAb, 
F(ab') 2 and FAb expression library fragments, and 
epitope-binding fragments thereof) , and small organic or 
inorganic molecules. 

20 Other compounds which can be screened in 

accordance with the invention include but are not limited 
to small organic molecules that are able to cross the 
blood-brain barrier, gain entry into an appropriate cell 
( e.g. . in the choroid plexus or in the hypothalamus) and 

25 affect the expression of the obR gene or some other gene 
involved in the ObR signal transduction pathway ( e.g. . by 
interacting with the regulatory region or transcription 
factors involved in gene expression) ; or such compounds 
that affect the activity of the ObR ( e.g. , by inhibiting 

3 0 or enhancing the enzymatic activity of the CD) or the 
activity of some other intracellular factor involved in 
the ObR signal transduction pathway, such as, for 
example, gpl30. 

Computer modelling and searching technologies 

35 permit identification of compounds, or the improvement of 
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already identified compounds, that can modulate ObR 
expression or activity. Having identified such a 
compound or composition, the active sites or regions are 
identified. Such active sites might typically be ligand 
5 binding sites, such as the interaction domains of Ob with 
ObR itself. The active site can be identified using 
methods known in the art including, for example, from the 
amino acid sequences of peptides, from the nucleotide 
sequences of nucleic acids, or from study of complexes of. 
10 the relevant compound or composition with its natural 
ligand. In the latter case, chemical or X-ray 
crystallographic methods can be used to find the active 
site by finding where on the factor the complexed ligand 
is found. 

15 Next, the three dimensional geometric structure of 

the active site is determined. This can be done by known 
methods, including X-ray crystallography, which can 
determine a complete molecular structure. On the other 
hand, solid or liquid phase NMR can be used to determine 

20 certain intra-molecular distances. Any other 

experimental method of structure determination can be 
used to obtain partial or complete geometric structures. 
The geometric structures may be measured with a complexed 
ligand, natural or artificial, which may increase the 

25 accuracy of the active site structure determined. 

If an incomplete or insufficiently accurate 
structure is determined, the methods of computer based 
numerical modelling can be used to complete the structure 
or improve its accuracy. Any recognized modelling method 

30 may be used, including parameterized models specific to 
particular biopolymers such as proteins or nucleic acids, 
molecular dynamics models based on computing molecular 
motions, statistical mechanics models based on thermal 
ensembles, or combined models. For most types of models, 
35 standard molecular force fields, representing the forces 
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between constituent atoms and groups, are necessary, and 
can be selected from force fields known in physical 
chemistry. The incomplete or less accurate experimental 
structures can serve as constraints on the complete and 
5 more accurate structures computed by these modeling 
methods . 

Finally, having determined the structure of 
the active site, either experimentally, by modeling, or 
by a combination, candidate modulating compounds can be 

10 identified by searching databases containing compounds 

along with information on their molecular structure. Such 
a search seeks compounds having structures that match the 
determined active site structure and that interact with 
the groups defining the active site. Such a seach can be 

15 manual, but is preferably computer assisted. These 
compounds found from this search are potential ObR 
modulating compounds. 

Alternatively, these methods can be used to 
identify improved modulating compounds from an already 

20 known modulating compound or ligand. The composition of 
the known compound can be modified and the structural 
effects of modification can be determined using the 
experimental and computer modelling methods described 
above applied to the new composition. The altered 

25 structure is then compared to the active site structure 
of the compound to determine if an improved fit or 
interaction results. In this manner systematic 
variations in composition, such as by varying side 
groups, can be quickly evaluated to obtain modified 

3 0 modulating compounds or ligands of improved specificity 
or activity. 

Further experimental and computer modeling methods 
useful to identify modulating compounds based upon 
identification of the active sites of Ob, ObR, and 
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related transduction and transcription factors will be 
apparent to those of skill in the art. 

Examples of molecular modelling systems are the 
CHARMm and QUANTA programs (Polygen Corporation, Waltham, 
5 MA) . CHARMm performs the energy minimization and 
molecular dynamics functions. QUANTA performs the 
construction, graphic modelling and analysis of molecular 
structure. QUANTA allows interactive construction, 
modification, visualization, and analysis of the behavior 

10 of molecules with each other. 

A number of articles review computer modelling of 
drugs interactive with specific proteins, such as 
Rotivinen, et al., 1988, Acta Pharmaceutical Fennica 
97:159-166; Ripka, New Scientist 54-57 (June 16, 1988); 

15 McKinaly and Rossmann, 1989, Annu . Rev , Pharmacol , 
Toxiciol. 29:111-122; Perry and Davies, OSAR: 
Quantitative Structure-Activity Relationships in Drug 
Design pp. 189-193 (Alan R. Liss, Inc. 1989); Lewis and 
Dean, 1989 Proc . R . Soc . Lond . 236:125-140 and 141-162; 

2 0 and, with respect to a model receptor for nucleic acid 

components, Askew, et al., 19 89. J. Am. Chem. Soc. 
111:1082-1090. Other computer programs that screen and 
graphically depict chemicals are available from companies 
such as BioDesign, Inc. (Pasadena, CA. ) , Allelix, Inc. 
25 (Mississauga, Ontario, Canada) , and Hypercube, Inc. 
(Cambridge, Ontario) . Although these are primarily 
designed for application to drugs specific to particular 
proteins, they can be adapted to design of drugs specific 
to regions of DNA or RNA, once that region is identified. 

3 0 Although described above with reference to design 

and generation of compounds which could alter binding, 
one could also screen libraries of known compounds, 
including natural products or synthetic chemicals, and 
biologically active materials, including proteins, for 
35 compounds which are inhibitors or activators. 
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Compounds identified via assays such as those 
described herein may be useful, for example, in 
elaborating the biological function of the ob^ gene 
product, and for ameliorating body weight disorders. 
Assays for testing the effectiveness of compounds, 
identified by, for example, techniques such as those 
described in Section 5.5.1 through 5.5.3, are discussed, 
below, in Section 5.5.4. 

1 . . IK VITRO SCREENING ASSAYS FOR 
COMPOUNDS THAT BIND TO Obf* 

In vitro systems may be designed to identify 
compounds capable of interacting with ( e.g. r binding to) 
ObR (including, but not limited to, the ECD or CD of 
ObR). Compounds identified may be useful, for example, 
in modulating the activity of wild type and/or mutant obR 
gene products; may be useful in elaborating the 
biological function of the ObR; may be utilized in 
screens for identifying compounds that disrupt normal ObR 
interactions; or may in themselves disrupt such 
20 interactions. 

The principle of the assays used to identify 
compounds that bind to the ObR involves preparing a 
reaction mixture of the ObR and the test compound under 
conditions and for a time sufficient to allow the two 
25 components to interact and bind, thus forming a complex 
which can be removed and/or detected in the reaction 
mixture. The ObR species used can vary depending upon 
the goal of the screening assay. For example, where 
agonists of the natural ligand are sought, the full 
30 length ObR, or a soluble truncated ObR, e.g. f in which 
the TM and/or CD is deleted from the molecule, a peptide 
corresponding to the ECD or a fusion protein containing 
the ObR ECD fused to a protein or polypeptide that 
affords advantages in the assay system ( e.g. . labeling, 
3 5 isolation of the resulting complex, etc.) can be 
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utilized. Where compounds that interact with the 
cytoplasmic domain are sought to be identified, peptides 
corresponding to the ObR CD and fusion proteins 
containing the ObR CD can be used. 
5 The screening assays can be conducted in a variety 

of ways. For example, one method to conduct such an 
assay would involve anchoring the ObR protein, 
polypeptide, peptide or fusion protein or the test 
substance onto a solid phase and detecting ObR/test 

10 compound complexes anchored on the solid phase at the end 
of the reaction. In one embodiment of such a method, the 
ObR reactant may be anchored onto a solid surface, and 
the test compound, which is not anchored, may be labeled, 
either directly or indirectly. 

15 In practice, microtiter plates may conveniently be 

utilized as the solid phase. The anchored component may 
be immobilized by non— covalent or covalent attachments. 
Non-covalent attachment may be accomplished by simply 
coating the solid surface with a solution of the protein 

20 and drying. Alternatively, an immobilized antibody, 
preferably a monoclonal antibody, specific for the 
protein to be immobilized may be used to anchor the 
protein to the solid surface. The surfaces may be 
prepared in advance and stored. 

25 In order to conduct the assay, the nonimmobilized 

component is added to the coated surface containing the 
anchored component. After the reaction is complete, 
unreacted components are removed ( e.g. . by washing) under 
conditions such that any complexes formed will remain 

3 0 immobilized on the solid surface. The detection of 
complexes anchored on the solid surface can be 
accomplished in a number of ways. Where the previously 
nonimmobilized component is pre-labeled r the detection of 
label immobilized on the surface indicates that complexes 

3 5 were formed. Where the previously nonimmobilized 
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component is not pre-labeled, an indirect label can be 
used to detect complexes anchored on the surface; e.g. . 
using a labeled antibody specific for the previously 
nonimmobilized component (the antibody, in turn, may be 
5 directly labeled or indirectly labeled with a labeled 
anti-Ig antibody) . 

Alternatively, a reaction can be conducted in a 
liquid phase, the reaction products separated from 
unreacted components, and complexes detected; e.g. P using 

10 an immobilized antibody specific for ObR protein, 
polypeptide, peptide or fusion protein or the test 
compound to anchor any complexes formed in solution, and 
a labeled antibody specific for the other component of 
the possible complex to detect anchored complexes. 

15 Alternatively, cell-based assays can be used to 

identify compounds that interact with ObR. To this end, 
cell lines that express ObR, or cell lines ( e.g. . COS 
cells, CHO cells, fibroblasts, etc.) that have been 
genetically engineered to express ObR ( e.g. . by 

20 transfection or transduction of ObR DNA) can be used. 

Interaction of the test compound with, for example, the 
ECD of obR expressed by the host cell can be determined 
by comparison or competition with native Ob. 

2 - . ASSAYS FOR INTRACELLULAR PROTEINS 
25 THAT INTERACT WITH THE ObR 

Any method suitable for detecting protein-protein 
interactions may be employed for identifying 
transmembrane proteins or intracellular proteins that 
interact with ObR. Among the traditional methods which 
3 0 may be employed are co-immunoprecipitation, crosslinlcing 
and co-purification through gradients or chromatographic 
columns of cell lysates or proteins obtained from cell 
ly sates and the ObR to identify proteins in the lysate 
that interact with the ObR. For these assays, the ObR 
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component used can be a full length ObR, a soluble 
derivative lacking the membrane-anchoring region ( e.g. , a 
truncated ObR in which the TM is deleted resulting in a 
truncated molecule containing the ECD fused to the CD) , a 
5 peptide corresponding to the CD or a fusion protein 
containing the CD of ObR. Once isolated, such an 
intracellular protein can be identified and can, in turn, 
be used, in conjunction with standard techniques, to 
identify proteins with which it interacts. For example,. 

10 at least a portion of the amino acid sequence of an 

intracellular protein which interacts with the ObR can be 
ascertained using techniques well known to those of skill 
in the art, such as via the Edman degradation technique. 
(See, e.g. . Creighton, 1983, "Proteins: Structures and 

15 Molecular Principles", W.H. Freeman & Co., N.Y., pp. 34- 
49) . The amino acid sequence obtained may be used as a 
guide for the generation of oligonucleotide mixtures that 
can be used to screen for gene sequences encoding such 
intracellular proteins. Screening may be accomplished, 

20 for example, by standard hybridization or PCR techniques. 
Techniques for the generation of oligonucleotide mixtures 
and the screening are well-known. (See, e.g. ; Ausubel, 
supra. . and PCR Protocols: A Guide to Methods and 
Applications, 1990, Innis, M . et al., eds. Academic 

25 Press, Inc., New York). 

Additionally, methods may be employed which result 
in the simultaneous identification of genes which encode 
the transmembrane or intracellular proteins interacting 
with ObR. These methods include, for example, probing 

30 expression, libraries, in a manner similar to the well 
known technique of antibody probing of Xgtll libraries, 
using labeled ObR protein, or an ObR polypeptide, peptide 
or fusion protein, e.g. . an ObR polypeptide or ObR domain 
fused to a marker ( e.g. . an enzyme, fluor, luminescent 

3 5 protein, or dye) , or an Ig-Fc domain. 
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One method which detects protein interactions in. 
vivo, the two-hybrid system, is described in detail for 
illustration only and not by way of limitation. One 
version of this system has been described (Chien et al., 
1991, Proc. Natl. Acad. Sci. USA, 88:9578-9582) and is 
commercially available from Clontech (Palo Alto, CA) . 

Briefly, utilizing such a system, plasmids are 
constructed that encode two hybrid proteins: one plasmid 
consists of nucleotides encoding the DNA-binding domain 
of a transcription activator protein fused to an QfeB 
nucleotide sequence encoding obR, an ObR polypeptide, 
peptide or fusion protein, and the other plasmid consists 
of nucleotides encoding the transcription activator 
protein's activation domain fused to a cDNA encoding an 
unknown protein which has been recombined into this 
plasmid as part of a cDNa' library . The DNA-binding 
domain fusion plasmid and the cDNA library are 
transformed into a strain of the yeast Saccharo^^ 
cerevisfrae that contains a reporter gene ( e.g. . HBS or 
iasZ) whose regulatory region contains the transcription 
activator's binding site. Either hybrid protein alone 
cannot activate transcription of the reporter gene: the 
DNA-binding domain hybrid cannot because it does not 
provide activation function and the activation domain 
hybrid cannot because it cannot localize to the 
activator's binding sites. Interaction of the two hybrid 
proteins reconstitutes the functional activator protein 
and results in expression of the reporter gene, which is 
detected by an assay for the reporter gene product. 

The two-hybrid system or related methodology may 
be used to screen activation domain libraries for 
proteins that interact with the "bait" gene product. By 
way of example, and not by way of limitation, ObR may be 
used as the bait gene product. Total genomic or cDNA 
35 sequences are fused to the DNA encoding an activation 
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domain. This library and a plasmid encoding a hybrid of 
a bait QhR gene product fused to the DNA-binding domain 
are cotransf ormed into a yeast reporter strain, and the 
resulting transf ormants are screened for those that 
5 express the reporter gene. For example, and not by way 
of limitation, a bait obR gene sequence, such as the open 
reading frame of obR (or a domain of obR) , as depicted in 
FIG. 1, FIG. 3 or FIG, 6 can be cloned into a vector such 
that it is translationally fused to the DNA encoding the 
10 DNA-binding domain of the GAL 4 protein. These colonies 
are purified and the library plasmids responsible for 
reporter gene expression are isolated. DNA sequencing is 
then used to identify the proteins encoded by the library 
plasmids. 

15 A cDNA library of the cell line from which 

proteins that interact with bait obR gene product are to 
be detected can be made using methods routinely practiced 
in the art. According to the particular system described 
herein, for example, the cDNA fragments can be inserted 

2 0 into a vector such that they are translationally fused to 
the transcriptional activation domain of GAL 4 . This 
library can be co-transformed along with the bait obR 
gene-GAL4 fusion plasmid into a yeast strain which 
contains a lacZ gene driven by a promoter which contains 

25 GAL 4 activation sequence. A cDNA encoded protein, fused 
to GAL 4 transcriptional activation domain, that interacts 
with bait obR gene product will reconstitute an active 
GAL4 protein and thereby drive expression of the HIS3 
gene. Colonies which express HIS3 can be detected by 

30 their growth on petri dishes containing semi-solid agar 
based media lacking histidine. The cDNA can then be 
purified from these strains, and used to produce and 
isolate the bait obR gene- interacting protein using 
techniques routinely practiced in the art. 
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3 . • ASSAYS FOR COMPOUNDS THAT INTERFERE 
WITH ObR/ INTRACELLULAR OR ObR/ 
TRANSMEMBRANE MACROMOLEOTTT. ig 
INTERACTION * f ~ 

5 The macromolecules that interact with the ObR are 

referred to, for purposes of this discussion, as "binding 

partners". These binding partners are likely to be 

involved in the ObR signal transduction pathway, and 

therefore, in the role of ObR in body weight regulation, 

10 Therefore, it is desirable to identify compounds that 

interfere with or disrupt the interaction of such binding 
partners with Ob which may be useful in regulating the 
activity of the ObR and control body weight disorders 
associated with ObR activity. 

15 The basic principle of the assay systems used to 

identify compounds that interfere with the interaction 
between the ObR and its binding partner or partners 
involves preparing a reaction mixture containing ObR 
protein, polypeptide, peptide or fusion protein as 

2 0 described in Sections 5,5.1 and 5.5-2 above, and the 

binding partner under conditions and for a time 
sufficient to allow the two to interact and bind, thus 
forming a complex. In order to test a compound for 
inhibitory activity, the reaction mixture is prepared in 
25 the presence and absence of the test compound. The test 
compound may be initially included in the reaction 
mixture, or may be added at a time subsequent to the 
addition of the ObR moiety and its binding partner. 
Control reaction mixtures. are incubated without the test 

3 0 compound or with a placebo. The formation of any 

complexes between the ObR moiety and the binding partner 
is then detected. The formation of a complex in the 
control reaction, but not in the reaction mixture 
containing the test compound, indicates that the compound 
3 5 interferes with the interaction of the ObR and the 
interactive binding partner. Additionally, complex 
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formation within reaction mixtures containing the test 
compound and normal ObR protein may also be compared to 
complex formation within reaction mixtures containing the 
test compound and a mutant ObR. This comparison may be 
5 important in those cases wherein it is desirable to 
identify compounds that disrupt interactions of mutant 
but not normal ObRs. 

The assay for compounds that interfere with the 
interaction of the ObR and binding partners can be 

10 conducted in a heterogeneous or homogeneous format . 
Heterogeneous assays involve anchoring either the ObR 
moiety product or the binding partner onto a solid phase 
and detecting complexes anchored on the solid phase at 
the end of the reaction. In homogeneous assays, the 

15 entire reaction is carried out in a liquid phase. In 
either approach, the order of addition of reactants can 
be varied to obtain different information about the 
compounds being tested. For example, test compounds that 
interfere with the interaction by competition can be 

20 identified by conducting the reaction in the presence of 
the test substance; i.e. . by adding the test substance to 
the reaction mixture prior to or simultaneously with the 
ObR moiety and interactive binding partner. 
Alternatively, test compounds that disrupt preformed 

25 complexes, e.g. compounds with higher binding constants 
that displace one of the components from the complex, can 
be tested by adding the test compound to the reaction 
mixture after complexes have been formed. The various 
formats are described briefly below. 

30 In a heterogeneous assay system, either the ObR 

moiety or the interactive binding partner, is anchored 
onto a solid surface, while the non-anchored species is 
labeled, either directly or indirectly. In practice, 
microtiter plates are conveniently utilized. The 

3 5 anchored species may be immobilized by non-covalent or 
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covalent attachments. Non-covalent attachment may be 
accomplished simply by coating the solid surface with a 
solution of the obR gene product or binding partner and 
drying. Alternatively, an immobilized antibody specific 
5 for the species to be anchored may be used to anchor the 
species to the solid surface. The surfaces may be 
prepared in advance and stored. 

In order to conduct the assay, the partner of the 
immobilized species is exposed to the coated surface with 

10 or without the test compound. After the reaction is 
complete, unreacted components are removed ( e.g. . by 
washing) and any complexes formed will remain immobilized 
on the solid surface. The detection of complexes 
anchored on the solid surface can be accomplished in a 

15 number of ways. Where the non-immobilized species is 
pre-labeled, the detection of label immobilized on the 
surface indicates that complexes were formed. Where the 
non-immobilized species is not pre-labeled, an indirect 
label can be used to detect complexes anchored on the 

2 0 surface; e.g. , using a labeled antibody specific for the 

initially non- immobilized species (the antibody, in turn, 
may be directly labeled or indirectly labeled with a 
labeled anti-lg antibody) . Depending upon the order of 
addition of reaction components, test compounds which 
25 inhibit complex formation or which disrupt preformed 
complexes can be detected. 

Alternatively, the reaction can be conducted in a 
liquid phase in the presence or absence of the test 
compound, the reaction products separated from unreacted 

3 0 components, and complexes detected; e.g. , using an 

immobilized antibody specific for one of the binding 
components to anchor any complexes formed in solution, 
and a labeled antibody specific for the other partner to 
detect anchored complexes. Again, depending upon the 
35 order of addition of reactants to the liquid phase, test 
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compounds which inhibit complex or which disrupt 
preformed complexes can be identified. 

In an alternate embodiment of the invention, a 
homogeneous assay can be used. In this approach, a 
5 preformed complex of the ObR moiety and the interactive 
binding partner is prepared in which either the ObR or 
its binding partners is labeled, but the signal generated 
by the label is quenched due to formation of the complex 
(see, e.g. , U.S. Patent No. 4,109,496 by Rubenstein which 

10 utilizes this approach for immunoassays) . The addition 
of a test substance that competes with and displaces one 
of the species from the preformed complex will result in 
the generation of a signal above background. in this 
way, test substances which disrupt ObR/ intracellular 

15 binding partner interaction can be identified. 

In a particular embodiment, an ObR fusion can be 
prepared for immobilization. For example, the ObR or a 
peptide fragment, e.g. , corresponding to the CD, can be 
fused to a glutathione-S-transf erase (GST) gene using a 

2 0 fusion vector, such as pGEX-5X-l, in such a manner that 
its binding activity is maintained in the resulting 
fusion protein. The interactive binding partner can be 
purified and used to raise a monoclonal antibody, using 
methods routinely practiced in the art and described 

2 5 above, in Section 5.3. This antibody can be labeled with. 

the radioactive isotope 125 I, for example, by methods 
routinely practiced in the art. In a heterogeneous 
assay, e.g. . the GST— ObR fusion protein can be anchored 
to glutathione-agarose beads. The interactive binding 

3 0 partner can then be added in the presence or absence of 

the test compound in a manner that allows interaction and 
binding to occur. At the end of the reaction period, 
unbound material can be washed away, and the labeled 
monoclonal antibody can be added to the system and 
3 5 allowed to bind to the complexed components. The 
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interaction between the obR gene product and the 
interactive binding partner can be detected by measuring 
the amount of radioactivity that remains associated with 
the glutathione-agarose beads. A successful inhibition 
5 of the interaction by the test compound will result in a 
decrease in measured radioactivity. 

Alternatively, the GST-ObR fusion protein and the 
interactive binding partner can be mixed together in 
liquid in the absence of the solid glutathione-agarose 
10 beads. The test compound can be added either during or 
after the species are allowed to interact. This mixture 
can then be added to the glutathione-agarose beads and 
unbound material is washed away. Again the extent of 
inhibition of the ObR/binding partner interaction can be 
15 detected by adding the labeled antibody and measuring the 
radioactivity associated with the beads. 

In another embodiment of the invention, these same 
techniques can be employed using peptide fragments that 
correspond to the binding domains of the ObR and/ or the 
2 0 interactive or binding partner (in cases where the 

binding partner is a protein) , in place of one or both of 
the full length proteins. Any number of methods 
routinely practiced in the art can be used to identify 
and isolate the binding sites. These methods include, 
25 but are not limited to, mutagenesis of the gene encoding 
one of the proteins and screening for disruption of 
binding in a co-immunoprecipitation assay. Compensating 
mutations in the gene encoding the second species in the 
complex can then be selected. Sequence analysis of the 
0 genes encoding the respective proteins will reveal the 
mutations that correspond to the region of the protein 
involved in interactive binding. Alternatively, one 
protein can be anchored to a solid surface using methods 
described above, and allowed to interact with and bind to 
5 its labeled binding partner, which has been treated with 
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a proteolytic enzyme, such as trypsin. After washing, a 
short, labeled peptide comprising the binding domain may 
remain associated with the solid material, which can be 
isolated and identified by amino acid sequencing. Also, 
5 once the gene coding for the intracellular binding 
partner is obtained, short gene segments can be 
engineered to express peptide fragments of the protein, 
which can then be tested for binding activity and 
purified or synthesized. 

10 For example, and not by way of limitation, an obR 

gene product can be anchored to a solid material as 
described, above, by making a GST-ObR fusion protein and 
allowing it to bind to glutathione agarose beads. The 
interactive binding partner can be labeled with a 

15 radioactive isotope, such as 35 S, and cleaved with a 

proteolytic enzyme such as trypsin. Cleavage products 
can then be added to the anchored GST- obR fusion protein 
and allowed to bind. After washing away unbound 
peptides, labeled bound material, representing the 

2 0 intracellular binding partner binding domain, can be 

eluted, purified, and analyzed for amino acid sequence by 
well-known methods. Peptides so identified can be 
produced synthetically or fused to appropriate 
facilitative proteins using recombinant DNA technology. 

2 5 4.. ASSAYS FOR IDENTIFICATION OF 

COMPOUNDS THAT AMELIORATE 
BODY WEIGHT DISORDERS 



Compounds, including but not limited to binding 
compounds identified via assay techniques such as those 

3 0 described, above, in Sections 5.5.1 through 5.5.3, can be 
tested for the ability to ameliorate body weight disorder 
symptoms, including obesity. The assays described above 
can identify compounds which affect ObR activity ( e.g. , 
compounds that bind to the ObR, inhibit binding of the 

3 5 natural ligand, and either activate signal transduction 
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(agonists) or block activation (antagonists) , and 
compounds that bind to the natural ligand of the obR and 
neutralize ligand activity) ; or compounds that affect obR 
gene activity (by affecting obR gene expression, 
5 including molecules, e.g. , proteins or small organic 

molecules, that affect or interfere with splicing events 
so that expression of the full length or the truncated 
form of the ObR can be modulated) . However, it should be 
noted that the assays described can also identify 
10 compounds that modulate ObR signal transduction r e. a. . 

compounds which affect downstream signalling events, such 
as inhibitors or enhancers of tyrosine kinase or 
phosphatase activities which participate in transducing 
the signal activated by Ob binding to the ObR) . The 
15 identification and use of such compounds which affect 
another step in the ObR signal transduction pathway in 
which the obR gene and/or ob£ gene product is involved 
and, by affecting this same pathway may modulate the 
effect of ObR on the development of body weight disorders 
0 are within the scope of the invention. Such compounds 
can be used as part of a therapeutic method for the 
treatment of body weight disorders. 

The invention encompasses cell-based and animal 
model-based assays for the identification of compounds 
exhibiting such an ability to ameliorate body weight 
disorder symptoms. Such cell-based assay systems can 
also be used as the "gold standard" to assay for purity 
and potency of the natural ligand, Ob, including 
recombinant ly or synthetically produced Ob and Ob 
mutants . 

Cell-based systems can be used to identify 
compounds which may act to ameliorate body weight 
disorder symptoms. Such cell systems can include, for 
example, recombinant or non-recombinant cells, such as 
cell lines, which express the obR gene. For example 
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choroid plexus cells, hypothalamus cells, or cell lines 
derived from choroid plexus or hypothalamus can be used. 
In addition, expression host cells fe.g. . cos cells, CHO 
cells, fibroblasts) genetically engineered to express a 
5 functional ObR and to respond to activation by the 
natural Ob ligand, e.g. , as measured by a chemical or 
phenotypic change, induction of another host cell gene, 
change in ion flux (e^., Ca ++ ) , tyrosine phosphorylation 
of host cell proteins, etc., can be used as an end point 

10 in the assay. 

In utilizing such cell systems, cells may be 
exposed to a compound suspected of exhibiting an ability 
to ameliorate body weight disorder symptoms, at a 
sufficient concentration and for a time sufficient to 

15 elicit such an amelioration of body weight disorder 

symptoms in the exposed cells. After exposure, the cells 
can be assayed to measure alterations in the expression 
of the obR gene, e.g. , by assaying cell lysates for obR 
mRNA transcripts (e.g. . by Northern analysis) or for obR 

20 protein expressed in the cell; compounds which regulate 
or modulate expression of the obR gene are good 
candidates as therapeutics. Alternatively, the cells are 
examined to determine whether one or more body weight 
disorder-like cellular phenotypes has been altered to 

25 resemble a more normal or more wild type, non-body weight 
disorder phenotype, or a phenotype more likely to produce 
a lower incidence or severity of disorder symptoms. 
Still further, the expression and/or activity of 
components of the signal transduction pathway of which 

3 0 ObR is a part, or the activity of the ObR signal 
transduction pathway itself can be assayed. 

For example, after exposure, the cell lysates can 
be assayed for the presence of tyrosine phosphorylation 
of host cell proteins, as compared to lysates derived 

35 from unexposed control cells. The ability of a test 
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proteins in these assay systems indicates that the test 
compound inhibits signal transduction initiated by ObR 
activation. The cell lysates can be readily assayed 
5 using a Western blot format; i.e. , the host cell proteins 
are resolved by gel electrophoresis, transferred and 
probed using a anti-phosphotyrosine detection antibody 
( Q *q- * an anti-phosphotyrosine antibody labeled with a 
signal generating compound, such as radiolabel, fluor, 
10 enzyme, etc.) (See, e.g. . Glenney et al., 1988, J. 

Immunol. Methods iQ9:277-285; Frackelton et al., 1983, 
Mol. Cell. Biol. 3:1343-1352). Alternatively, an ELISA 
format could be used in which a particular host cell 
protein involved in the ObR signal transduction pathway 
15 is immobilized using an anchoring antibody specific for 
the target host cell protein, and the presence or absence 
of phosphotyrosine on the immobilized host cell protein 
is detected using a labeled anti-phosphotyrosine 
antibody. (See, King et al., 1993, Life Sciences 
20 53:1465-1472). In yet another approach, ion flux, such 
as calcium ion flux, can be measured as an end point for 
ObR stimulated signal transduction. 

Alternatively, activation of STAT proteins, and 
stimulation of transcription mediated through IL-6 
25 responsive gene elements may be measured to test the 
ability of a compound to regulate ObR mediated signal 
transduction. For example, a recombinant expression 
vector may be engineered to contain the IL6 responsive 
element sequences cloned adjacent to a reporter gene and 
3 0 regulation of ObR activity may be measured by assaying 
for reporter gene activity. Reporter genes that may be 
used include, but are not limited to those encoding 
chloramphenicol acetyl transferase (CAT) , firefly 
lucif erase or human growth hormone. 
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In addition, animal-based body weight 
disorder systems, which may include, for example, ob, db 
and ob/djj mice, may be used to identify compounds capable 
of ameliorating body weight disorder-like symptoms. Such 
5 animal models may be used as test substrates for the 
identification of drugs, pharmaceuticals, therapies and 
interventions which may be effective in treating such 
disorders. For example, animal models may be exposed to 
a compound, suspected of exhibiting an ability to 
10 ameliorate body weight disorder symptoms, at a sufficient 
concentration and for a time sufficient to elicit such an 
amelioration of body weight disorder symptoms in the 
exposed animals. The response of the animals to the 
exposure may be monitored by assessing the reversal of 
15 disorders associated with body weight disorders such as 
obesity. with regard to intervention, any treatments 

which reverse any aspect of body weight disorder-like 
symptoms should be considered as candidates for human 
body weight disorder therapeutic intervention. Dosages 
2 0 of test agents may be determined by deriving dose- 
response curves, as discussed in Section 5.7.1, below. 

F.. THE TREATMENT OP BODY WEIGHT, 

INCLUDING BODY WEIGHT DISORDERS 

The invention encompasses methods and compositions 
25 for modifying body weight and treating body weight 

disorders, including but not limited to obesity, cachexia 
and anorexia. Because a loss of normal obR gene product 
function results in the development of an obese 
phenotype, an increase in obR gene product activity, or 
30 activation of the ObR pathway ( e.g. . downstream 

activation) would facilitate progress towards a normal 
body weight state in obese individuals exhibiting a 
deficient level of obR gene expression and/or obR 
activity. 
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Alternatively, symptoms of certain body weight 
disorders such as, for example, cachexia, which involve 
lower than normal body weight phenotype, may be 
ameliorated by decreasing the level of obR gene 
expression, and/or obR gene activity, and/or 
downregulating activity of the ObR pathway ( e.g. . by 
targeting downstream signalling events) . Different 
approaches are discussed below. 



10 



1 - . INHIBITION OF ObR EXPRESSION OR 

ObR ACTIVITY TO PROMOTE W EIGHT QATK 

Any method which neutralizes Ob or inhibits 
expression of the obR gene (either transcription or 
translation) can be used to effectuate weight gain. Such 
approaches can be used to treat body weight disorders 
15 such as anorexia or cachexia. Such methods can also be 
useful for agricultural applications; i.e. . to increase 
the weight of livestock animals. 

For example, the administration of soluble 
peptides, proteins, fusion proteins, or antibodies 
2 0 (including anti-idiotypic antibodies) that bind to and 
"neutralize" circulating Ob, the natural ligand for the 
ObR, can be used to effectuate weight gain. To this end, 
peptides corresponding to the ECU of ObR, soluble 
deletion mutants of ObR f e.q. . ATMObR mutants) , or either 
25 of these ObR domains or mutants fused to another 

polypeptide (e.g. , an IgFc polypeptide) can be utilized. 
Alternatively, anti-idiotypic antibodies or Fab fragments 
of antiidiotypic antibodies that mimic the ObR ECD and 
neutralize Ob can be used (see Section 5.3, supra ) . Such 
30 ObR peptides, proteins, fusion proteins, anti-idiotypic 
antibodies or Fabs are administered to a subject in 
amounts sufficient to neutralize Ob and to effectuate 
weight gain. 
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ObR peptides corresponding to the ECD having the 
amino acid sequence shown in FIGs. 1 or 6 f from about 
amino acid residue number 23 to about 837, or having the 
amino acid sequence shown in FIG . 3 from about amino acid 
5 residue number 21 to about 839 can be used. ObR ATM 
mutants in which all or part of the 2 3 amino acid 
hydrophobic anchor sequence ( e.g. . about amino acid 
residue number 838 to 860 in FIGs . 1 or 6, or about 840 
to about 862 in FIG. 3) could also be used. Fusion of 
10 the ObR, the ObR ECD or the ATMObR to an IgFc polypeptide 
should not only increase the stability of the 
preparation, but will increase the half-life and activity 
of the ObR-Ig fusion protein in vivo . The Fc region of 
the Ig portion of the fusion protein may be further 
15 modified to reduce immunoglobulin effector function. See 
Section 10, infra . 

In a specific embodiment described herein the 
extracellular domains of the mouse or human ObR were 
fused to the IgG constant region. As indicated in FIG. 
20 10, purified ObR-IgG was able to potently inhibit, or 
neutralize, the binding of the AP-OB fusion protein to 
cell surface ObR . (See Section 10.4.) 

In an alternative embodiment for neutralizing 
circulating Ob, cells that are genetically engineered to 
25 express such soluble or secreted forms of ObR may be 

administered to a patient, whereupon they will serve as 
"bioreactors" in vivo to provide a continuous supply of 
the Ob neutralizing protein. Such cells may be obtained 
from the patient or an MHC compatible donor and can 
30 include, but are not limited to fibroblasts, blood cells 
(e^_9U./ lymphocytes) , adipocytes, muscle cells, 
endothelial cells etc. The cells are genetically 
engineered in vitro using recombinant DNA techniques to 
introduce the coding sequence for the ObR ECD, ATMObR, or 
35 for ObR-Ig fusion protein ( e.g. . ObR-, ECD- or ATMObR- 
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IgFc fusion proteins) into the cells, e.g. . by 
transduction (using viral vectors, and preferably vectors 
that integrate the transgene into the cell genome) or 
transfection procedures, including but not limited to the 
5 use of plasmids, cosmids, YACs, electroporation, 

liposomes, etc. The obE coding sequence can be placed 
under the control of a strong constitutive or inducible 
promoter or promoter/ enhancer to achieve expression and 
secretion of the ObR peptide or fusion protein. The 
10 engineered cells which express and secrete the desired 
ObR product can be introduced into the patient 
systemically , e.g. . in the circulation, 

intraperitoneal^, at the choroid plexus or hypothalamus. 
Alternatively, the cells can be incorporated into a 

15 matrix and implanted in the body, e.g. r genetically 

engineered fibroblasts can be implanted as part of a skin 
graft; genetically engineered endothelial cells can be 
implanted as part of a vascular graft. (See, for 
example, Anderson et al. U.S. Patent No. 5,399,349; and 

20 Mulligan & Wilson, U.S. Patent No. 5,460,959 each of 
which is incorporated by reference herein in its 
entirety) . 

When the cells to be administered are non- 
autologous cells, they can be administered using well 

2 5 known techniques which prevent the development of a host 

immune response against the introduced cells. For 
example, the cells may be introduced in an encapsulated 
form which, while allowing for an exchange of components 
with the immediate extracellular environment, does not 

3 0 allow the introduced cells to be recognized by the host 

immune system. 

In an alternate embodiment, weight gain therapy 
can be designed to reduce the level of endogenous obR 
gene expression, e.g. . using antisense or ribozyme 
3 5 approaches to inhibit or prevent translation of obR mRNA 
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transcription of the obR gene; or targeted homologous 
recombination to inactivate or "knock out" the obR gene 
or its endogenous promoter. Because the obR gene is 
5 expressed in the brain, including the choroid plexus and 
hypothalamus, delivery techniques should be preferably 
designed to cross the blood-brain barrier (see PCT 
WO89/10134, which is incorporated by reference herein in 
its entirety) . Alternatively, the antisense, ribozyroe or 

10 DNA constructs described herein could be administered 
directly to the site containing the target cells; e.g. . 
the choroid plexus and/or hypothalamus . 

Antisense approaches involve the design of 
oligonucleotides (either DNA or RNA) that are 

15 complementary to ObR mRNA. The antisense 

oligonucleotides will bind to the complementary obR mRNA 
transcripts and prevent translation. Absolute 
complementarity, although preferred, is not required. A 
sequence "complementary" to a portion of an RNA, as 

2 0 referred to herein, means a sequence having sufficient 

complementarity to be able to hybridize with the RNA, 
forming a stable duplex; in the case of double-stranded 
antisense nucleic acids, a single strand of the duplex 
DNA may thus be tested, or triplex formation may be 
25 assayed. The ability to hybridize will depend on both 
the degree of complementarity and the length of the 
antisense nucleic acid. Generally, the longer the 
hybridizing nucleic acid, the more base mismatches with 
an RNA it may contain and still form a stable duplex (or 

3 0 triplex, as the case may be) . One skilled in the art can 

ascertain a tolerable degree of mismatch by use of 
standard procedures to determine the melting point of the 
hy br id i z ed complex . 

Oligonucleotides that are complementary to the 5' 
3 5 end of the message, e.g. # the 5' untranslated sequence up 
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to and including the AUG initiation codon, should work 
most efficiently at inhibiting translation. However, 
sequences complementary to the 3' untranslated sequences 
of mRNAs have recently shown to be effective at 
5 inhibiting translation of mRNAs as well. See generally, 
Wagner, R. , 1994, Nature 372:333-335. Thus, 
oligonucleotides complementary to either the 5'- or 3'- 
non- translated, non-coding regions of the obR shown in 
FIG. 1 (murine short form), FIG.. 6 (murine long form) or 

10 FIG. 3 (human long form) could be used in an antisense 
approach to inhibit translation of endogenous obR mRNA. 
Oligonucleotides complementary to the 5' untranslated 
region of the mRNA should include the complement of the 
AUG start codon. Antisense oligonucleotides 

15 complementary to mRNA coding regions are less efficient 
inhibitors of translation but could be used in accordance 
with the invention. Whether designed to hybridize to the 
5'-, 3'- or coding region of ObR mRNA, antisense nucleic 
acids should be at least six nucleotides in length, and 

20 are preferably oligonucleotides ranging from 6 to about 
50 nucleotides in length. In specific aspects the 
oligonucleotide is at least 10 nucleotides, at least 17 
nucleotides, at least 25 nucleotides or at least 50 
nucleotides. 

25 Regardless of the choice of target sequence, it is 

preferred that In vitro studies are first performed to 
quantitate the ability of the antisense oligonucleotide 
to inhibit gene expression. It is preferred that these 
studies utilize controls that distinguish between 

3 0 antisense gene inhibition and nonspecific biological 
effects of oligonucleotides. It is also preferred that 
these studies compare levels of the target RNA or protein 
with that of an internal control RNA or protein. 
Additionally, it is envisioned that results obtained 

3 5 using the antisense oligonucleotide are compared with 
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those obtained using a control oligonucleotide. it is 
preferred that the control oligonucleotide is of 
approximately the same length as the test oligonucleotide 
and that the nucleotide sequence of the oligonucleotide 
5 differs from the antisense sequence no more than is 

necessary to prevent specific hybridization to the target 
sequence. 

The oligonucleotides can be DNA or RNA or chimeric 
mixtures or derivatives or modified versions thereof, 

10 single-stranded or double-stranded. The oligonucleotide 
can be modified at the base moiety, sugar moiety, or 
phosphate backbone, for example, to improve stability of 
the molecule, hybridization, etc. The oligonucleotide 
may include other appended groups such as peptides ( e.g. . 

15 for targeting host cell receptors in vivo ) , or agents 
facilitating transport across the cell membrane (see, 
e.g., Letsinger et al. , 1989, Proc. Natl. Acad. Sci. 
U.S.A. 86:6553-6556; Lemaitre et al., 1987, Proc. Natl. 
Acad. Sci. 84:648-652; PCT Publication No. WO88/ 09810, 

20 published December 15, 1988) or the blood-brain barrier 
(see, e.g., PCT Publication No. WO89/10134, published 
April 25, 1988), hybridization-triggered cleavage agents. 
(See, e.g., Krol et al., 1988, BioTechniques 6:958-976) 
or intercalating agents. (See, e.g., Zon, 1988, Pharm. 

25 Res. 5:539-549). To this end, the oligonucleotide may be 
conjugated to another molecule, e.g., a peptide, 
hybridization triggered cross-linking agent, transport 
agent, hybridization-triggered cleavage agent, etc. 

The antisense oligonucleotide may comprise at 

3 0 least one modified base moiety which is selected from the 
group including but not limited to 5-f luorouracil , 
5-bromouracil , 5-chlorouracil , 5-iodouracil , 
hypoxanthine, xantine, 4-acetylcytosine, 
5- (carboxyhydroxylmethyl) uracil, 

35 5-carboxymethylaminomethyl-2-thiouridine , 
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5-carboxymethylaminomethyluracil, dihydrouracil beta-D 
galactosylqueosine, inosine, "6-isopentenyladenlne 

1- methylguanine, 1-methylinosine, 2 , 2-dimethylguanlne 
5 '" lne ^ y ' adenine ' 2-n.ethylguanine, 3-methylcytosi„e, ' 
5 5-methylcytosine, N6-adenine, 7-methylguanine 

5-methylaminomethyluracil , 5-methoxyaminomethyl- 

2- thiouracil, beta-D-mannosylqueosine, 

5 • -methoxycarboxymethyluracil , 5-methoxyuracil 
2- me thyl t hio- N 6-i SO pent:enyladenine, uracil-5-oxyacetic 
10 acid (v), wybutoxosine, pseudouracil , queosine 

2-thiocytosine, 5-methyl-2-thiouracil, 2-thiouracil 
4-thiouracil, 5-methyluracil, uracil-5-oxyacetic acid 
methylester, uracil-5-oxyacetic acid (v) , 5-methyl- 

15 ': t \ i ° UraCi1 ' 3 -( 3 -^no-3-K- 2 -carboxypro P yl) uracil, 
15 (ac P 3)w, and 2 , 6-diaminopurine. 

The antisense oligonucleotide may also comprise at 
least one modified sugar moiety selected from the group 
including but not limited to arabinose, 
2-fluoroarabinose, xylulose, and hexose. 
20 m yet another embodiment, the antisense 

oligonucleotide comprises at least one modified phosphate 
backbone selected from the group consisting of a 
phosphorothioate, a phosphorodithioate, a 
Phosphoramidothioate, a phosphoramidate , a 
25 phosphordiamidate, a methylphosphonate, an alkyl 

Phosphotriester, and a formacetal or analog thereof. 

In yet another embodiment, the antisense 
oligonucleotide is an a-anomeric oligonucleotide. An «- 

30 rr 6 ^ C ° lig0nUCleotide specific double-stranded 

30 hybrids wxth complementary rna in which , con ^ 

usual 0-units, the strands run parallel to each other 
(Gautier et al., 1987, Nucl. Acids Res. 15:6625-6641) 
The oligonucleotide is a 2'-o-methylribonucleotide (inoue 
et al., 1987, Nucl. Acids Res. 15:6131-6148), or a 
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chimeric RNA-DNA analogue (Inoue et al., 1987, FEBS Lett 
215:327-330) . 

Oligonucleotides of the invention may be 
synthesized by standard methods known in the art, e.g. by 
5 use of an automated DNA synthesizer (such as are' 
commercially available from Biosearch, Applied 
Biosystems, etc.). As examples, phosphorothioate 
oligonucleotides may be synthesized by the method of 
Stein et al. (1988, Nucl. Acids Res. 16:3209), 

10 methylphosphonate oligonucleotides can be prepared by use 
of controlled pore glass polymer supports (Sarin et al., 
1988, Proc. Natl. Acad. Sci . U.S.A. 85:7448-7451), etc/ 
While antisense nucleotides complementary to the 
obR coding region sequence could be used, those 

15 complementary to the transcribed untranslated region are 
most preferred. For example, antisense oligonucleotides 
having the following sequences can be utilized in 
accordance with the invention: 

a) 5 ' -CATCTTACTTCAGAGAA-3 ' which is complementary to 
2 0 nucleotides -14 to +3 in FIG. 3. 

b) 5 ' -CATCTTACTTCAGAGAAGTACAC-3 ' which is 
complementary to nucleotides -20 to +3 in FIG. 3. 



C) 5 ' -CATCTTACTTCAGAGAAGTACACCCATAA-3 ' which is 

complementary to nucleotides -2 6 to +3 in FIG. 3. 

25 d > 5 ' — CATCTTACTTCAGAGAAGTACACCCATAATCCTCT-3 ' which is 

complementary to nucleotides -3 2 to +3 in FIG. 3. 



e) 5 ' — AATCATCTTACTTCAGAGAAGTACACCCATAATCC-3 which is 

complementary to nucleotides -29 to +6 in FIG. 3. 
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f) 5/ — CTTACTTCAGAGAAGTACACCCATAATCC—3 which is 

complementary to nucleotides -29 to -i i n FIG. 3, 

9) 5 ' — TCAGAGAAGTACACCCATAATCC-3 which is 

complementary to nucleotides -29 to -7 in FIG. 3. 



5 h) 5'-AAGTACACCCATAATCC-3 which is complementary to 

nucleotides -29 to -13 in FIG. 3. 



cells 



The antisense molecules should be delivered to 



which express the ObR In vivo, e.g. . the choroid plexus 
10 and/or hypothalamus. A number of methods have been 

developed for delivering antisense DNA or RNA to cells; 

antisense molecules can be injected directly into 

the tissue site, or modified antisense molecules, 

designed to target the desired cells ( e.g. , antisense 
15 linked to peptides or antibodies that specifically bind 

receptors or antigens expressed on the target cell 

surface) can be administered systemically . 

However, it is often difficult to achieve 

intracellular concentrations of the antisense sufficient 
20 to suppress translation of endogenous mRNAs . Therefore 

preferred approach utilizes a recombinant DNA construct 

in which the 

antisense oligonucleotide is placed under the control of 
a strong pol III or pol II promoter. The use of such a 

25 construct to transfect target cells in the patient will 
result in the transcription of sufficient amounts of 
single stranded RNAs that will form complementary base 
pairs with the endogenous obR transcripts and thereby 
prevent translation of the obR mRNA. For example, a 

30 vector can be introduced in vivo such that it is taken up 



a 
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by a cell and directs the transcription of an antisense 
RNA. Such a vector can remain episomal or become 
chromosomally integrated, as long as it can be 
transcribed to produce the desired antisense RNA. Such 
5 vectors can be constructed by recombinant DMA technology 
methods standard in the art. Vectors can be plasmid, 
viral, or others known in the art, used for replication 
and expression in mammalian cells. Expression of the 
sequence encoding the antisense RNA can be by any 

10 promoter known in the art to act in mammalian, preferably 
human cells. Such promoters can be inducible or 
constitutive. Such promoters include but are not limited 
to: the SV40 early promoter region (Bernoist and Chambon, 
1981, Nature 290:304-310), the promoter contained in the 

15 3 ' long terminal repeat of Rous sarcoma virus (Yamamoto 
et al., 1980, Cell 22:787-797), the herpes thymidine 
kinase promoter (Wagner et al., 1981, Proc. Natl. Acad. 
Sci. U.S.A. 78:1441-1445), the regulatory sequences of 
the metallothionein gene (Brinster et al., 1982, Nature 

20 296:39-42), etc. Any type of plasmid, cosmid, YAC or 
viral vector can be used to prepare the recombinant DNA 
construct which can be introduced directly into the 
tissue site; e.g. , the choroid plexus or hypothalamus. 
Alternatively, viral vectors can be used which 

25 selectively infect the desired tissue; ( e.g. . for brain, 
herpesvirus vectors may be used) , in which case 
administration may be accomplished by another route 
( e.g. . systemically) . 

Ribozyme molecules designed to catalytically 

3 0 cleave obR mRNA transcripts can also be used to prevent 
translation of obR mRNA and expression of ObR. (See, 
e.g. . PCT International Publication WO90/11364, published 
October 4, 1990; Sarver et al., 1990, Science 247:1222- 
1225) . While ribozymes that cleave mRNA at site specific 

35 recognition sequences can be used to destroy obR mRNAs, 
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the use of hammerhead ribozymes is preferred „»» k 
ribozymes cleave mKNAs at locations'dictaLd by ™T 
regxons that tor. complementary base pairs with the 
target mRNA. The sole requirement is that the tarl t 
5 mRNA have the blowing segU ence of two bases ^ 3 , 

Is e werrT UCti ° n ^ Pr ° dUCti0n ° f rhead rihoe^s * 

W611 kn ° Wn ln the art ^d is described more fully^n 
Haseloff and Gerlach, 1988, Nature, 334:585-591. There 
are hundreds of potential hammerhead ribozyme cleavaae 

10 ;™ r nucieotide — - - « 

the I Preferab1 ^ the ri boZyine is engineered so that 

the cleavage recognition site is located near ^ 5 , 
of the pbB mRNA; i^, to increase eff ici ^ 

XS ^L^.^r 0 ' 11 - 1 " — ^ - non-functional 
For example, hammerhead ribozymes having the 
followxng sequences can be utilized in accordance with 
the invention: " 

a) 5 ' ~ ACAGAAUUUUUGACAAAUCAAAG CAG ANNNNUCUGAGNAGUCCUUAC 

UUCAGAGAA-3' which will cleave human obR mRNA 
between nucleotides -1 and 1 i n FIG. 3 

5 • -^CCCGGGCAGCCUGCCCAAAGCCGGNNNNCCGGAGNAGUCGCCAGA 
CCGGCUCGUG-3' which will cleave between 
nucleotides -175 and -176 in FIG. 3 

5--UGGCAUGCAAGACAAAGCAGGNNNNCCUGAGNAGUCCUUAAAUCUCC 
AAGGAGUAA -3' which will cleave between 

nucleotides 102 and 103 in FIG. 3. 

5 ' ~ UAUAUGAC AAAG ^UGUNNNNACAG AGNAGUCCUUGUGUGGUAAAGAC 
ACG-3' Which will cImvo v^«- 



b) 



25 c) 



30 



ACG-3 whxch wxll cleave between nucleotides 994 
and 995 in FIG. 3. 

5 ' -^CACCAAUUGAAUUGAUGGCCAAAGCGGGNNNNCCCGAGNAGUCAA 
CCGUAACAGUAUGU-3' which will cleave between 
nucleotides 2142 and 2143 in FIG. 3. 
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f) 



5 



5 ' -UGAAAUUGUUUCAGGCUCCAAAGCCGGNNNNCCGGAGNAGUCAAGAA 
GAGGACCACAUGUCACUGAUGC-3' which will cleave 
between nucleotides 2 73 6 and 27 3 7 in FIG . 3. 

9 ) 5 ' -GGUUUCUUCAGUGAAAUUACACAAAGCAGCNNNNGCUGAGNAGUCAG 

UUAGGUCACACAUC-3' which will cleave between 
nucleotides 3492 and 3493 in FIG. 3. 

h ) 5 ' -ACCCAUUAUAACACAAAGCUGANNNNUCAGAGNAGUCAUCUGAAGGU 

UUCUUC-3' which will cleave between nucleotides 
3521 and 3522 in FIG. 3. 



10 



The ribozymes of the present invention also 
include RNA endoribonucleases (hereinafter "Cech-type 
ribozymes") such as the one which occurs naturally in 
Tetrahymena Thermophila (known as the IVS , or L-19 ivs 
RNA) and which has been extensively described by Thomas 
15 Cech and collaborators (Zaug, et al., 1984, Science, 

224:574-578; Zaug and Cech, 1986, Science, 231:470-475; 
Zaug, et al., 1986, Nature, 324:429-433; published 
International patent application No. wo 88/04 3 00 by 
University Patents Inc.; Been and Cech, 1986, Cell, 
20 47:207-216). The Cech-type ribozymes have an eight base 
pair active site which hybridizes to a target RNA 
sequence whereafter cleavage of the target RNA takes 
place. The invention encompasses those Cech-type 
ribozymes which target eight base-pair active site 
25 sequences that are present in obR . 

As in the antisense approach, the ribozymes can be 
composed of modified oligonucleotides ( e.g. for improved 
stability, targeting, etc.) and should be delivered to 
cells which express the ObR In vivo, e.g. . hypothalamus 
3 0 and/ or the choroid plexus. A preferred method of 

delivery involves using a DNA construct "encoding" the 
ribozyme under the control of a strong constitutive pol 
III or pol II promoter, so that transfected cells will 
produce sufficient quantities of the ribozyme to destroy 
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endogenous obR mess age S and inhibit translation « 
rhymes unlike a„ tisense ffloleculcs , ^^J^"- 

e«L™ CellUlar — - i- ~guired for 

* Endogenous obR. gene expression can also be reduce 

>Y inactivating or knocking out" the pbR gene 
propter using targeted homologous reco^L^ ^ q 
see Smithies et al. , 1985 , Nature 317:230-234 t^JI^' 
capecchi. 198 7, cell » ll50 ,- Bla; Thompson ^al 
■0 Cell 5:313-321; each of which is incorporated by' 
reference herein in its entirety) . For example^ a 
mutant, non-functional ObR (or a completely unrelated DNA 

_JS gene) can be used, with or without a selectable 
marker and/or a negative selectable marker J™*** 1 * 
cells that express ObR i n vivo T L 1 ' ° transfec * 
^ ±n VIVO< Insertion of the DNA 

=t1; a ^ 

■ -ocularly suited in ^ ^^^^ 
modifications to ES (embryonic stem) cells can 7 

generate animal offsprina „i, h * USSd to 

P lng Wlth an inactive ObR r e a « M 
Thomas & capecchi i 987 and Thompson i 989 , ^J"" 
However this approach can be adapted for hun 
provided the recombinant DNA constructs are direct ^ 
administered or targeted to the required "te^ ^L 
using appropriate viral vectors *, n k ^ S 
vectors f Dr h-! • vectors, e^, herpes virus 

vectors for delivery to brain tissue; e^ the 
hypothalamus and/or choroid plexus. 

Alternatively, endogenous obR gene 
be reduced by taraeti™ * w expression can 

targeting deoxyribonucleotide sequences 

complementary to the M m,i,f y nces 

, i6 J 6 re 9«latory region of the obR g ene 

(i^U, the ofeB Promoter and/or enhancers) to f~ 
helical structures that prevent transcription oi thT^ 
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C. 1991 # Anticancer Drug Des., 6(6):569-84; Helene, c, 
et al., 1992, Ann, N.Y. Accad. Sex. f 660:27-36; and 
Maher, L.J., 1992, Bioassays 14 (12) : 807-15) . 

In yet another embodiment of the invention, the 
5 activity of obR can be reduced using a "dominant 

negative" approach to effectuate weight gain. To this 
end, constructs which encode defective ObRs can be used 
in gene therapy approaches to diminish the activity of 
the ObR in appropriate target cells. For example, 

10 nucleotide sequences that direct host cell expression of 
ObRs in which the CD ( e.g. . FIG. 1 amino acid residue 
number 861-894; FIG. 6 amino acid residue number 861- 
1162; or FIG. 3, amino acid residue number 863-1165) or a 
portion of the CD ( e.g. . the box 1 Jak interaction 

15 sequence; FIGs. 1 and 6, amino acid residue 861-884; or 
FIG. 3, amino acid residue 863-886) is deleted or mutated 
can be introduced into cells in the choroid plexus or 
hypothalamus (either by in vivo or ex vivo gene therapy 
methods described above) . Alternatively, targeted 

2 0 homologous recombination can be utilized to introduce 

such deletions or mutations into the subject's endogenous 
obR gene in the hypothalamus or choroid plexus. The 
engineered cells will express non- functional receptors 
(i -e. , an anchored receptor that is capable of binding 
25 its natural ligand, but incapable of signal 

transduction) . Such engineered cells present in the 
choroid plexus or hypothalamus should demonstrate a 
diminished response to the endogenous Ob ligand, 
resulting in weight gain. 

3 0 2.. RESTORATION OR INCREASE IN 

ObR EXPRESSION OR ACTIVITY 
TO PROMOTE WEIGHT LOSS 

With respect to an increase in the level of normal 

obR gene expression and/or ObR gene product activity, obR 

3 5 nucleic acid sequences can be utilized for the treatment 
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15 



!• • EFFECTIVE DQflB 
Toxicity and therapeutic efficacy of such 
compounds can be determined by standard pharmaceutical 
procedures in cell cultures or experimental animals, 
e^, for determining the LD 50 (the dose lethal to 50% of 
the population) and the ED 50 (the dose therapeutically 
effective in 50% of the population) . The dose ratio 
between toxic and therapeutic effects is the therapeutic 
index and it can be expressed as the ratio LD 50 /ED 50 . 
Compounds which exhibit large therapeutic indices are 
preferred. While compounds that exhibit toxic side 
effects may be used, care should be taken to design a 
delivery system that targets such compounds to the site 
of affected tissue in order to minimize potential damage 
to uninfected cells and, thereby, reduce side effects. 

The data obtained from the cell culture assays and 
animal studies can be used in formulating a range of 
dosage for use in humans. The dosage of such compounds 
lies preferably within a range of circulating 
concentrations that include the ED 50 with little or no 
toxicity. The dosage may vary within this range 
depending upon the dosage form employed and the route of 
administration utilized. For any compound used in the 
method of the invention, the therapeutically effective 
dose can be estimated initially from cell culture assays. 
A dose may be formulated in animal models to achieve a 
circulating plasma concentration range that includes the 
Ic 50 ( ** e - / the concentration of the test compound which 
achieves a half -maximal inhibition of symptoms) as 
30 determined in cell culture. Such information can be used 
to more accurately determine useful doses in humans. 
Levels in plasma may be measured, for example, by high 
performance liquid chromatography. 



20 



25 
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2 . . FORMULATIONS AND USE 

Pharmaceutical compositions for use in accordance 
with the present invention may be formulated in 
conventional manner using one or more physiologically 
5 acceptable carriers or excipients. 

Thus, the compounds and their physiologically 
acceptable salts and solvates may be formulated for 
administration by inhalation or insufflation (either 
through the mouth or the nose) or oral, buccal, 
10 parenteral or rectal administration. 

For oral administration, the pharmaceutical 
compositions may take the form of, for example, tablets 
or capsules prepared by conventional means with 
pharmaceutically acceptable excipients such as binding 
15 agents ( e.g. . pregelatinised maize starch, 

polyvinylpyrrolidone or hydroxypropyl roethy Ice llu lose) ; 
fillers f e.g. . lactose, microcrystalline cellulose or 
calcium hydrogen phosphate) ; lubricants (e^, magnesium 
stearate, talc or silica) ; disintegrants (e^, potato 
20 starch or sodium starch glycolate) ; or wetting agents 
( e.g. . sodium lauryl sulphate) . The tablets may be 
coated by methods well known in the art. Liquid 
preparations for oral administration may take the form 
of, for example, solutions, syrups or suspensions, or 

2 5 they may be presented as a dry product for constitution 

with water or other suitable vehicle before use. Such 
liquid preparations may be prepared by conventional means 
with pharmaceutically acceptable additives such as 
suspending agents ( e.g. , sorbitol syrup, cellulose 

3 0 derivatives or hydrogenated edible fats) ; emulsifying 

agents ( e.g. . lecithin or acacia) ; non-aqueous vehicles 
( e.g. . almond oil, oily esters, ethyl alcohol, or 
fractionated vegetable oils) ; and preservatives (e^cu., 
methyl or propyl-p-hydroxybenzoates or sorbic acid) . The 
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preparations may also contain buffer salts, flavoring, 
coloring and sweetening agents as appropriate. 

Preparations for oral administration may be 
suitably formulated to give controlled release of the 
5 active compound - 

For buccal administration the compositions may 
take the form of tablets or lozenges formulated in 
conventional manner* 

For administration by inhalation, the compounds 
10 for use according to the present invention are 

conveniently delivered in the form of an aerosol spray 
presentation from pressurized packs or a nebulizer, with 
the use of a suitable propellant, e.g.,, 
dichlorodif luoromethane , tr ichlorof luoromethane , 
15 dichlorotetraf luoroethane, carbon dioxide or other 

suitable gas. In the case of a pressurized aerosol the 
dosage unit may be determined by providing a valve to 
deliver a metered amount. Capsules and cartridges of 
e.g. gelatin for use in an inhaler or insufflator may be 
20 formulated containing a powder mix of the compound and a 
suitable powder base such as lactose or starch. 

The compounds may be formulated for parenteral 
administration by injection, e.g. , by bolus injection or 
continuous infusion. Formulations for injection may be 
25 presented in unit dosage form, e.g. , in ampoules or in 
multi-dose containers, with an added preservative. The 
compositions may take such forms as suspensions, 
solutions or emulsions in oily or aqueous vehicles, and 
may contain formulatory agents such as suspending, 
3 0 stabilizing and/ or dispersing agents. Alternatively, the 
active ingredient may be in powder form for constitution 
with a suitable vehicle, e.g. . sterile pyrogen-free 

water, before use. 

The compounds may also be formulated in rectal 
3 5 compositions such as suppositories or retention enemas, 
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e.g. , containing conventional suppository bases such as 
cocoa butter or other glycerides. 

In addition to the formulations described 
previously, the compounds may also be formulated as a 
5 depot preparation. Such long acting formulations may be 
administered by implantation (for example subcutaneous ly 
or intramuscularly) or by intramuscular injection. Thus, 
for example, the compounds may be formulated with 
suitable polymeric or hydrophobic materials (for example 
10 as an emulsion in an acceptable oil) or ion exchange 

resins, or as sparingly soluble derivatives, for example,, 
as a sparingly soluble salt. 

The compositions may, if desired, be presented in 
a pack or dispenser device which may contain one or more 
15 unit dosage forms containing the active ingredient. The 
pack may for example comprise metal or plastic foil, such 
as a blister pack. The pack or dispenser device may be 
accompanied by instructions for administration. 

vi.. example; in situ localization OF ObR m 

20 the Example presented herein, it is demonstrated via 
binding studies with Ob ( leptin) -alkaline phosphatase 
(AP) fusion proteins that high affinity Ob receptor is 
present in mammalian choroid plexus tissue. It is 
further demonstrated that the fusion protein binding 

2 5 observed was Ob-specific, and not due to a non-specific 
alkaline phosphatase-based artifact. 

A. - MATERIALS AND METHODS 
construction and Expression of Ob-Alkaline 
Phosphatase Fusion Proteins . Two types of fusion protein 
30 were generated. Specifically, Ob-AP fusion proteins were 
generated in which the AP portion was at the carboxyl 
terminus of the fusion protein, and AP-Ob fusion proteins 
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were generated in which the AP portion was at the amino 
terminus of the fusion protein. 

To produce mouse and human Ob-AP and AP-ob fusion 
constructs, cDNA sequences were amplified by standard 
5 polymerase chain reaction procedures- For mouse and 
human Ob-AP fusions, nucleotide sequences encoding the 
entire open reading frames of mouse and human Ob, 
respectively were amplified from the corresponding cDNAs. 
Restriction sites at the end of the amplification primers 

10 were cut with Hindi 1 1 and BamHI (mouse) and inserted into 
the Hindlll-Bglll polylinker site of APtag-2 , or BamHI 
and Bglll (human) and inserted into the Bglll site of 
APtag-2. For mouse and human AP-Ob fusion constructs, a 
new AP fusion vector expressing an AP molecule with its 

15 own signal peptide was first generated (APtag-3) by 

replacing sequences between the Hindlll and Xhol sites of 
APtag-2 with PCR amplified sequences of secreted 
placental alkaline phosphatase (including signal 
sequence) . A Bglll site was placed so that fusions 

2 0 introduced into this site would be in-frame with the AP 
protein. The sequences of the predicted mature forms of 
mouse and human Ob were then PCR amplified from the 
corresponding cDNAs . Restriction sites at the end of the 
amplification primers were cut with BamHI and Bglll and 

25 inserted into the Bglll site of APtag-3 . 

Each plasmid was transiently transfected into COS- 
7 cells (11.25 jxg/150 mm plate). Cells were grown to 
confluence and then media-conditioned for 3 days. Cells 
were then centrifuged, 0.45Mm filtered, and stored at 4°C 

30 with 20 mM Hepes (pH 7.0) and 0.05% sodium azide. 

Conditioned media were tested and quantitated for AP 
activity in a 96-well plate reader as described by 
Flanagan and Leder (Flanagan, J.G. and Leder, P., 1990, 
Cell £3:185-194), except that homoarginine was omitted 

35 from all assays. 
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7 n nltu Prote ^ Binding. Quartered mouse 

brains, isolated choroid plexus, cells and cell lines 
were rinsed once with HBHA (Hank's balanced salt solutxon 
with 0.5mg/ml BSA, 0.1% NaN 3 , 20mM HEPES [pH 7.0]) xn 
5 12-well plates. Tissue was then incubated with tissue 
culture supernatants containing AP-Ob fusion, Ob-AP 
fusion, or control supernatants (i^, supernatants 
containing unfused AP only, containing AP-OB or OB-AP 
fusion proteins plus 80-fold molar excess of E. colx - 
10 derived recombinant OB, or supernatants from mock- 
transf ected COS cells) , for 75 minutes with gentle 
rotation at room temperature. Samples were then treated 
as described previously (Cheng, H.J. and Flanagan, J.G., 
1994, Cell 79: 157-168) . 

B.. RESULTS 
To search for the Ob receptor, Ob-alkaline 
phosphatase fusion proteins were constructed which would 
allow color imetric detection of Ob bxndxng. 

Specifically, cDNA molecules encoding the mouse and human 
20 Ob proteins were inserted into the expression vectors 

APtag-2 and APtag-3 , as described, above in Section 6.1. 
msertion into the expression vector APtag-2 resulted xn 
a fusion protein with Ob at the N-terminus of the fusxon 
protein and placental alkaline phosphatase (AP) at the 
C -terminus. The resulting fusion protein is referred to 
as Ob-AP. insertion into the vector APtag-3 resulted xn 
fusion proteins with AP at the N-terminus fused to the 
predicted mature form of the Ob protein at the 
C -terminus. The resulting fusion protein is referred to 
as AP-Ob. Both forms of murine fusion proteins were 
secreted and both were produced at the predicted 
molecular weight of approximately 81 kDa. 

several strategies were employed in an effort to 
identify cells or tissues expressing the 0b receptor. 



25 



30 
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Each of the cells, cell lines and tissues tested as 
described herein were at least potentially involved in 
body weight regulation. The first strategy employed was 
to attempt direct binding assays with the Ob-AP and AP-Ob 
5 fusion proteins. Cell lines examined by this strategy 
included the placental cell lines Be Wo (ATCC No. CCL98) 
and JAR (ATCC No. HTB144); the muscle cell lines L6 (ATCC 
No. CRL1458) and BC3H (ATCC No. CRL1443); the neural cell 
lines PC12 (ATCC No. CRL1721) and NB41A3 (ATCC No. 

10 CCL147) ; the preadipose cell line 3T3-L1 (ATCC No. 
CRIil73); and the liver cell line Hepal-6 (ATCC No. 
CRL183 0) . Also tested by this method were primary 
cultures from hypothalamus and primary cultures from 
cerebellum. None of these studies yielded positive 

15 binding results. 

Second, attempts were made to identify cell lines 
expressing Ob receptor by examining changes in gene 
expression in response to the presence of recombinant Ob 
protein. The rationale here was that changes in gene 

2 0 expression, whether obR gene expression or the expression 

of genes further downstream in the Ob/ObR-related signal 
transduction pathway, would identify cells in which ObR 
was present. 

This analysis was done by standard differential 
25 display analysis (see Pardee et al., U.S. Patent No. 

5,262,311) of RNA derived from Ob-treated or untreated 
cells. Briefly, RNA was isolated from cells which either 
had or had not been exposed to Ob, and was amplified via 
RT-PCR in a manner which allowed a direct quantitative 
30 comparison of the levels of individual transcripts 

present in the RNA derived from the Ob-treated relative 
to the Ob-untreated cell lines. Ob Cell lines tested by 
this approach were INS-1, 3T3-L1, Hepal-6, L6 , PC12 , 
NB41A3 and BC3H. In addition, primary hypothalamic 

3 5 cultures were also tested- None of the cells tested 
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exhibited a detectable quantitative difference in 
expression pattern based on whether the cells had or had 
not been treated with Ob. 

Third, attempts to identify cells expressing Ob 
5 receptor were made by treating cells with recombinant Ob 
protein and assaying for signs of signal transduction 
pathway activation- Specifically, cAMP changes were 
monitored via 3 H uptake, and tyrosine phosphorylation 
changes were assayed via Western blots treated with anti- 
10 phosphotyrosine antibodies. Over twenty cell lines were 
examined in this manner. Specifically, these cell lines 
included the mouse cell lines Yl (adrenal cortex; ATCC 
No. CCL79) , BC3H (smooth muscle-brain tumor; ATCC No. 
CRL1443) , P19 (embryonal carcinoma; ATCC No. CRL1825) , 
15 3T3L1 (preadipocyte; ATCC No. CRL173) , Hepal-6 (hepatoma; 
ATCC No. CRL1830), C2C12 (myoblast; ATCC No. CRL1772), 
NMUMG (mammary gland, normal epithelial; ATCC No. 
CRL1636), MM5MT (mammary gland; ATCC No. CRL1637), NB41A3 
(neuroblastoma; ATCC No. CCL147) , AtT20 (pituitary; ATCC 
20 No. CCL89), N MU LI (liver; ATCC No. CRL 1638), BNL CL2 
(liver; ATCC No. TIB73) , and NCTC-1469 (liver; ATCC No. 
CCL91); rat cell lines, including L6 (myoblast; ATCC No. 
CRL1458) , PC12 (adrenal chromaffin; ATCC No. CRL1721) , 
and H-4-II-E (hepatoma; ATCC No. CRL1548) ; and human cell 
25 lines, including SW872 (liposarcoma; ATCC No. HTB92) , 

Hepa G2 (liver; ATCC No. HB8065) , and neuroblastoma cell 
lines, including SK-N-SH (ATCC No. HTBll) - Here again, 
no Ob-dependent differences were observed in any of the 
cells tested. 

30 After an extensive search of mammalian cell lines 

and tissues, adult mouse brains were guartered, treated 
with AP-Ob fusion protein, washed, and tested for bound 
AP activity of the fusion protein using histological 
techniques, as described, above, in Section 6.1. 

35 Reproducible binding of the AP-Ob fusion protein was 
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observed in the rodent brain choroid plexus (within the 
lateral and third brain ventricals) . No AP-Ob staining 
was observed, however, in the brain tissues surrounding 
the choroid plexus. The choroid plexus is a tissue 
5 largely responsible for the generation of the cerebral 
spinal fluid- Further, choroid plexus tissue is 
considered to be one of the "guardians 11 of the blood- 
brain barrier. 

Control AP staining was performed on tissues 
10 treated with unfused AP and on tissues which had been 
treated with AP-Ob in the presence of an excess of 
unfused Ob added to compete for the binding of the fusion 
protein. Staining similar to that observed for the Ab-Ob 
fusion protein was not observed in either of these 
15 controls, demonstrating that the AP-Ob binding observed 
was Ob-specific, and not due to an AP-based artifact. 

In summary, therefore, only after employing 
several strategies, was a cell surface molecule which 
binds Ob located; and this cell surface molecule was 
20 found within a specific region of the brain, the choroid 
plexus . 

- * EXAMPLE t CLONING OF THE MURINE ObR GENE 

Described, below, in Section 7.2.1, is the 
successful cloning of a short form Ob receptor cDNA, 

25 famj5312, from expression libraries constructed using 

murine choroid plexus RNA. The expression libraries were 
screened using AP-Ob fusion protein binding, as 
described, above, in the Example presented in Section 6. 
Section 7.2.2, below, describes the nucleotide sequence 

3 0 of the short form Ob receptor coding region and, further, 
describes the amino acid sequence of the Ob short form 
receptor protein. Section 7.2.3, below, describes 
competitive binding studies demonstrating that the 
protein encoded by the isolated cDNA encodes a receptor 
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exhibiting high affinity binding for both mouse and human 
Ob protein. Section 7.2.4 describes studies which verify 
the authenticity of the isolated obR cDNA clone. 

The high affinity Ob binding exhibited by the ObR, 
5 coupled with its homology to the Class I family of 

cytokine receptors, as described, below, indicates that 
the ObR is involved in the control of mammalian body 
weight, via signal transduction triggered by its binding 
to Ob ligand. 

10 A.. MATERIALS AND METHODS 

Choroid Plexus mRNA Isolation . Total RNA was 
isolated from 300 mouse choroid plexuses in batches of 
100, using the guanidinium isothiocyanate/CsCl method of 
Chirgwin et al. (1979, Biochemistry 18:5294) as described 

15 by R. Selden in Current Protocols for Molecular Biology 
(4.2.3 Supplement 14). After quantitation, the RNA was 
diluted to lmg/ml in distilled, deionized water and 
incubated for 30 min at 37°c with an equal volume of 
DNase solution (20 mM MgCl 2 , 2 mM DTT, 0.1 units DNase, 

20 0.6 units RNase inhibitor in TE) to remove contaminating 
DNA. The RNA was extracted with 

phenol/chlorof orm/ isoamyl , and ethanol precipitated. 
After quantitation at 260 nm, an aliquot was 
electrophoresed to check the integrity. A total of 3 20/ig 

2 5 of total RNA was purified. 

Poly A+ RNA was isolated using an Oligotex-dT kit 
(catalog # 7 004 2) from Qiagen (Chatsworth, CA) as 
described by the manufacturer. After quantitation, the 
mRNA was ethanol precipitated and resuspended at lmg/ml 

30 in distilled, deionized, DEPC-treated water. A total of 
llAtg of poly A+ RNA was purified. 

Library Construction . cDNA was synthesized 
according to the method of Gubler and Hoffman (Gene, 
1983, 2J5:263) using a Superscript Plasmid cDNA synthesis 
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10 



kit (Catalog # Series 8248) purchased from Life 
Technologies (Gaithersburg , MD) . The cDNA obtained was 
ligated into the Notl/Sal I sites of the mammalian 
expression vector pMET7 , a modified version of pMEISS, 
which utilizes the SRa promoter as described previously 
(Takebe, Y. et al., 1988, Mol. Cel. Bio. 8:466). This 
vector was chosen because it contains a strong eukaryotic 
promoter, is expressed in COS7 cells, contains the AMP 
resistance gene, and is only 3.0 kb in length. The small 
size of the vector is important because it increases the 
probability of cloning large cDNAs. other comparable 
vectors are 4.8 kb and larger, thereby increasing the 
chances of imperfect replication, and reducing the 
probability of cloning large cDNAs. Ligated cDNA was 
15 ethanol precipitated and resuspended in distilled, 

deionized, DEPC-treated water at 2 5 ng/ml. One nl of the 
DMA was transformed by electroporation per 40 (il °f 
electrocompetent DH10B fi. cojj in a 0 . 1 cm cuvette. 

cDNA was synthesized twice and used to construct 
two independent mouse choroid plexus libraries: 
roCP (mouse choroid plexus) A and mCP D. 

r»«a Pr BP aration. Based on titers of the cDNA 
transformations, 96-deepwell plates were inoculated with 
150 cfu/well of primary transf ormants in 1 ml of LB-amp. 
Primary transf ormants grown only 1 hour at 3 7«C prior to 
aliquoting were used to avoid the overgrowth of smaller 
insert clones and thus underrepresentation of larger 
clones in the 150 cfu pools. Cultures were grown 15-16 
hours at 37-C with aeration. Prior to prepping, 100 M 1 of 
cell suspension was removed and added to 100^1 of 50% 
glycerol, mixed and stored at -80°C (glycerol freeze 
plate) . 

DNA was prepared using the Wizard" Minipreps DNA 
Purification Systems (Promega, Madison, WI; Catalog No. 



20 



30 
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A7100) employing modifications for a 96-well format. The 
protocol was as follows: 

1) Cultures were centrifuged in 96-deepwell 
plate at 3200 rpm for 10 minutes at 4°C. 

5 Supernatants were removed. 

2) 140/xl each of cell resuspension solution 
(50 mM Tris-HCl, pH 7.5, 10 mM EDTA, 100 Mg/ml 
RNase A) , cell lysis solution (0.2 M NaOH; 1.0% 
SDS) and neutralization solution (1.32 M Potassium 

.0 acetate, pH 4.8) were added, in order, with 

vortexing 14 seconds after addition of each 
reagent, to ensure good mixing. 

3) Plates were placed in ice water for 15 
minutes . 

L5 4) Samples were centrifuged at 3 200 rpm for 

10 minutes at 4°C. 

5) supernatants were transferred to 96-well 
Polyf iltronics polypropylene filterplate (10 
micron, 0. 8ml) - 

20 6) 500/xl WP resin were added and incubated 

3-5 minutes at RT; suction was applied to plate. 

7) Samples were washed three times with 
640jxl of the resuspension solution. 

8) Samples were centrifuged at 3 200 rpm for 
25 5 minutes at RT, to remove residual buffer. 

9) samples were eluted 2-5 minutes with 40^1 
room temperature water. 

10) Eluted DNA was centrifuged through to 
microwell plate at 3 2 00 rpm for 5 minutes at room 

3 0 temperature. 

11) DNA was quant itated. 

Pooling strategy . The pooling strategy was 
devised to provide optimal sized pools, 1200 cfu, for 
transfection and detection, and quick breakdown to the 
35 smaller pools of 150. Once a positive pool of 150 was 
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identified, between 400 to 800 individual clones were 
needed to provide representation of the pool. Using a 
single pool of 1200 cfu initially would have meant fewer 
DNA probes but would have required the use of more 
5 individual clones (3200-6400) in the final identification 
step thereby requiring significantly more time to 
identify a positive clone. 

DNAs totalling 5/ig were pooled equally from eight 
wells, one column, to give a total of 1200 cfu. Thus, 
10 each 96-well plate gave rise to 12 pooled DNAs for 
transfection into COS-7 cells. 

When a positive pool was identified, DNA was 
prepared from each of the eight wells constituting the 
pool and retransfected into COS-7 cells. When a positive 
15 well was identified, the well was broken down by plating 
out an aliquot of the glycerol freeze of that well such 
that several thousand individual colonies were obtained. 
For each positive well, between 400 and 800 colonies were 
picked and arrayed in a 96-well format, DNA was obtained, 
20 as described above, and the DNA from 24 wells was pooled 
for transfection. DNA representing each individual clone 
from a positive row was isolated and transfected for 
final identification. 

Quantitative Ob cell surf ace binding analysis. 
25 Quantitative cell surface binding assays with AP-Ob 

fusion proteins were performed essentially as described 
previously for Kit-AP (Flanagan, J-G. and Leder, P. , 
1990, Cell 63:185-194. 

Ob Protein . The recombinant murine Ob protein 
30 used herein has been described previously (Campfield et 
al., 1995, Science 269:546-549). The recombinant human 
Ob protein used herein was purified from Baculovirus 
supernatants with a monoclonal antibody column containing 
monoclonal antibody directed against human Ob. The 
3 5 purified recombinant human Ob protein was judged by 
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standard Coomasie blue staining to be greater than 95% 
pure. 

DHA Se quencing , Sequencing and sequence assembly 
were performed as described previously (International 
5 Polycystic Kidney Consortium, 1995, Cell 8_l: 289-298) . 

Northern Analysis , Northern blot analysis of poly 
A + mRNA from various tissues (Clontech) was probed, using 
standard techniques (Chirgwin, J.M. et al. 1979, 
Biochemistry IB.* 5294-5299) , with labeled DNA amplified 
10 from sequences encoding the murine ObR extracellular 
domain, 

rt-PCR . Reverse transcription PCR (rt-PCR) 
reactions were performed on 1 fig total RNA utilizing 
standard techniques (Zhang, Y . et al., 1994, Nature 
15 372 : 425*432) . Specifically, first strand cDNA was 

prepared using random hexamers. The first strand cDNA 
was then PCR amplified using primers derived from 
sequences encoding the ObR extracellular domain or G3PDH 
contro 1 primers . 

20 B.. RESULTS 

1 . . CLONING OF THE OB RECEPTOR 

FROM MOUSE CHOROID PLEXUS 

The strong, Ob-specific binding of the AP-Ob 

fusion protein to the murine choroid plexus described 

2 5 above, in the Example presented in Section 6, suggested 

that an Ob receptor could be expressed at high levels 
within this tissue. In order to attempt to clone a cDNA 
encoding the Ob receptor, therefore, the choroid plexuses 
from 3 00 mice were dissected, and a total of 11 jig poly 

3 0 A + RNA was isolated from the tissue to be used to 

construct cDNA libraries as described above, in Section 
7.1. 

Initially, 3 fig poly A + were used to generate 
cDNA, to be used in constructing mouse choroid plexus 
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cDNA library A. All of the cDNAs generated which were 
greater than 500 bp in size (261 ng) were pooled and 90 
ng were ligated to pMET7 . Transformation of this ligated 
cDNA into electrocompetent DH10B E. coli resulted in a 
5 library of approximately 7.2 x 10 5 cfu, with an average 

size of 1 kb. 

Recognizing that cDNA library A did not contain a 
sufficient number of clones containing inserts large 
enough to encode a receptor at a statistically reasonable 
10 frequency, a second 3 M9 of poly A + RNA was used to 

generate 758 ng of cDNA. 32 ng of cDNA representing the 
largest two fractions of cDNA were pooled and ligated 
into pMET7. Transformation of these ligated cDNA 
molecules resulted in mouse choroid plexus library D, 
15 with 2.4 x 10 5 cfu and an average insert size of 2 kb. 

Using only the largest two fractions of cDNA ensured that 
the library would be biased towards large cDNAs. This was 
confirmed by characterizing the insert sizes of ten 
clones; seven clones had inserts greater than 2 kb in 
2 0 length and no clones were seen with inserts smaller than 
1 kb. This was in contrast to the library A where 16 out 
of 2 0 clones were smaller than lkb. 

DNA representing 6 x 10 5 cfu (40 plates) was 
prepared and pooled from the mouse choroid plexus library 

2 5 A. DNA representing 2.4 x 10 5 cfu (16 plates) was 

prepared from mouse choroid plexus library D. 

For screening purposes, the libraries were 
produced as pools of 150 clones, with a mixture of 8 
pools being used in each transfection ( i.e. , 1200 

3 0 clones/transfection) . Pooled DNA was transiently 

transfected into COS-7 cells, and the cells were screened 
by incubation with supernatants containing the murine AP- 
Ob fusion protein, washed, and stained for AP activity in 
situ, all as described, above, in Sections 6.1 and 6.2. 
35 Once a positive pool was identified, the 8 individual 
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subpools were each tested separately, and the resulting 
positive subpool was further subdivided until a single 
positive clone was identified. 

A total of 632 DNA pools were derived from 
5 libraries A and D, with a total of 10 independent 

positive pools being identified. All of these positive 
pools were successfully broken down into subpools of 150 
clones each, and one positive subpool was further 
subdivided until a single positive clone was identified. 
10 The clone, which contained a 5.1 kB cDNA insert, was 
designated f am j 53 12 . 

2. . THE Ob RECEPTOR (ObR) AND ObR GENE 

The famj5312 murine obR cDNA clone isolated, as 
described above, in Section 7.2.1, contained an insert of 
15 approximately 5.1 kb. The nucleotide sequence obtained 
from this clone is depicted in FIG. 1 (SEQ ID NO:l). The 
nucleotide sequence of the clone revealed a single open 
reading frame, the ObR derived amino acid sequence of 
which is also depicted in FIG. 1 (SEQ ID NO:2). 
0 The deduced 894 amino acid sequence of the murine 

ObR protein begins with a methionine whose codon is 
within a DNA sequence that is consistent with a 
translation initiation site. The ObR amino acid sequence 
begins with a hydrophobic signal sequence from amino acid 
5 residues 1-23, typical of proteins which are to be either 
membrane-associated or secreted. 

The murine Ob receptor protein contains a single 
hydrophobic transmembrane domain from amino acid residues 
838-860, indicating that the Ob receptor spans the cell 
10 membrane once. 

The position of the transmembrane domain indicates 
that the extracellular portion of the mature murine ObR 
protein spans from amino acid residue 24 to amino acid 
residue 837. A database search reveals that the 
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extracellular domain of ObR contains regions of homology 
which place ObR into the Class I family of cytokine 
receptors (for reviews, see, e.g. . Heldin, C-H., 1995, 
Cell 80:213-223; and Kishimoto, T. and Tetsuya, T. , 1994, 
5 Cell 76:253-252). ObR appears to be most closely related 
to the gpl3 0 signal transducing component of the IL-6 
receptor, the GSF receptor and the LIF receptor. 
Alignment studies of ObR and gpl30 amino acid sequences 
revealed that, although the overall sequence identity 
10 between the two proteins is low, the characteristic 

conserved cysteine residues, the Trp-Ser-X-Trp-Ser motif, 
and other amino acid residues conserved within the class 
I family of proteins are clearly evident. 

Following the single transmembrane domain, the 
15 murine Obr protein contains a short cytoplasmic domain of 
34 amino acids f i.e. . amino acid residues 861-894). 
Homology comparisons also reveal that the first twenty 
three amino acids of the ObR cytoplasmic domain show a 
30% identity to membrane proximal sequences of the LIF 

20 receptor. 

Reverse transcription PCR amplification of obE 
mRNA from total RNA confirmed the presence of obR 
transcript (a single band of about 5kb) in choroid 
plexus, and also demonstrated its presence in 

25 hypothalamus. Further, Northern blot analysis of poly A + 
RNA derived from several mouse tissues revealed that obR 
mRNA is present in additional tissues, such as lung and 
kidney. 

3. . THE OB RECEPTOR STRONGLY BINDS OB PROTEIN 

30 An analysis of the binding of AP-Ob to the ObR 

encoded by the obR cDNA described above, in Section 
7.2.2, was conducted. The results of this analysis, 
depicted in FIG. 2, demonstrate that the ObR exhibits 
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strong, Ob-specific binding to both mouse and human Ob 
protein. 

A quantitative analysis of the binding of the AP 
fusion proteins is shown in FIG . 2. After transient 

5 transfection of the ObR clone into cos cells, strong 
binding of l nM murine AP-Ob is detected (relative to 
mock transfected COS cells or ObR transfected COS cells 
incubated with unfused AP) (FIG. 2A) . This binding is 
nearly completely inhibited by 100 nM untagged 

.0 recombinant mouse or human leptin protein, demonstrating 
that this receptor can bind native Ob. A fusion between 
AP and human Ob also binds mouse ObR with high affinity, 
as does a fusion protein with mouse leptin at the N- 
terminus and AP at the C-terminus (Ob-AP) . Scatchard 

5 analysis of the binding of mouse AP-Ob (FIG. 2B) produced 
a value for the dissociation constant (K D ) of 0.7 x 10~ 9 
M. 



4 - • AUTHENTICITY OF THE f»m j5312 CT.OWP! 

The authenticity of the isolated obR famj53i2 
20 clone was tested in several ways. First, 8 independently 
isolated clones (in subpools of 150 clones each) were PCR 
amplified with primers made to obR sequences 3' of the 
stop codon. Sequencing verified that all 8 clones 
contained the same 3' untranslated sequences. In 

2 5 addition, the regions of 5 independently isolated clones 

encoding the ObR C-terminus were sequenced and each was 
shown to utilize the same stop codon. Finally, reverse 
transcription PCR (rt-PCR) of choroid plexus total RNA 
isolated from a different mouse strain (C57/BLKsJ) than 

3 0 that from which the cDNA libraries were derived generated 

an identical PCR product containing a stop codon in the 
same location. These data indicated that the isolated 
famj5312 cDNA clone was neither a chimeric clone nor was 
it the result of a rare aberrant splicing event, but, 
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rather, represents a clone which encodes the predominant 
form of the ObR receptor in the choroid plexus. 

5 . . CLONING HOUSE LONG FORM ObR 

ENCODING NUCLEIC ACIDS 

5 As described herein, we have cloned the murine ObR 

long form. 

In order to find the mouse homolog of the human 
long form of the obR gene (FIG. 3) , semi-nested PCR was 
performed on first strand cDNA isolated from mouse 

10 hypothalamus, Ks, and choroid plexus, db and Ks, with 5' 
primers from the region just before mouse short form 
starts to diverge from the human long form, and 3' 
degenerate primers designed from the human ObR homolog 
intracellular region. The complete transcript was 

15 further characterized by 3' RACE. 

Total mRNA was prepared from C57B1/KS (KS) and 
C57Bl/KS-db (db) choroid plexus and hypothalamus. cDNA 
was reverse-transcribed from 1 jig of cDNA of mRNA using 
random hexamer or oligo dT as primer with Superscript 

20 Reverse Transcriptase from GIBCO-BRL. A total 24 /xg of 
cDNA was made. For PCR, cDNA was diluted 1:200 and 3 fig 
of the diluted cDNA was used in a 25 fil reaction. 

The first round of PCR reactions used a 5' primer 
encoding the mouse ObR protein sequence PNPKNCSW, and 

25 consisting of nucleotides 5 ' -CCA AAC CCC AAG AAT TGT TCC 
TGG-3 ' , and a reverse degenerate primer complementary to 
the nucleotide sequence encoding KIMENKMCD, adjacent to 
the carboxy terminus of the human long form and 
consisting of nucleotides 5'-TC (GA)CA CAT (CT)TT (GA)TT 

3 0 (GATC)CC CAT TAT CTT-3 ' . 

For the second round of PCR reactions, the 3 ' 
primer was the same, and the 5' primer, which was 
internal to the previous 5' primer, encoded the mouse ObR 
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protein sequence AQGLNFQK , and consisted of nucleotides 
5 '-GCA CAA GGA CTG AAT TTC CAA AAG-3'. 

PGR reactions were carried out as described above 
except the nested PCR profile was 94 «C for 3 minutes- 
5 94«C for 30 seconds, 57° for 30 seconds, 72°C for 4 0 ' 
seconds for 30 cycles; 72 -c for 5 minutes for one cycle. 

DNA sequencing was performed on the automatic ABI 
3 73A and 377 DNA sequencer by using the Taq cycle 
sequencing kit (Applied Biosystems, Foster City, CA) . 
10 Sequence analysis was performed using Sequencher. 

Semi-nested PCR of the nucleic acids encoding the 
intracellular domain of murine long form ObR was also 
performed on mRNA isolated from hypothalamus in order to 
obtain sufficient quantities of a specific PCR product 
15 encoding the mouse long form of obR gene. Sequencing of 
the PCR product (FIG. 6) confirmed that this DNA encodes 
the mouse homolog of the long form of ObR. The 
transcripts of the short and long forms are identical 
until the fifth codon 5' of the stop codon of the short 
20 form and then diverge completely, suggestive of 

alternative splicing. The deduced amino acid sequences 
from mouse long form and the human ObR are homologous 
throughout the length of the coding region and share 75% 
identity (FIG . 7). 

25 6 -« EXPRESSION PROFILE OF ObR mRNA 

As a first step in understanding the tissue 
distribution of ObR, the expression of its mRNA was 
examined in various murine tissues. To this end, 
Northern blot analysis of poly a+ mRNA (2 ^g/lane) 

30 derived from various mouse tissues (heart, brain, spleen, 
lung, liver, skeletal muscle, kidney and testes; 
Clontech, Palo Alto, CA) was probed with labelled DNA 
amplified from sequences encoding the ObR extracellular 
domain. Hybridizations were done in Rapid-hyb buffer 
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(Amersham) at 65°C following the manufacturer's 
instructions . 

In most tissues, the obR mRNA appears as a single 
band slightly larger than 5kb, indicating that the S.ikb 
5 cDNA clones described herein are full-length- of the 

tissues assayed, expression was seen in lung, kidney, and 
total brain. No expression was detected in testes. 

RT-PCR amplification of the obR mRNA from total 
RNA confirmed the presence of this transcript in choroid 

10 plexus and also demonstrated its presence in 

hypothalamus. The RT-PCR reactions were performed on l/ig 
total RNA isolated from mouse choroid plexus or 
hypothalamus. Tissues were isolated from db/db mice 
(C57Bl/BLKsJ background) or +/+ littermate controls. 

15 First strand cDNA, prepared using random hexamers, was 
PCR amplified using primers derived from sequences 
encoding the ObR extracellular domain or G3PDH control 
primers. No bands were detected from the amplification 
of mock reverse-transcribed total RNA controls run in 

2 0 parallel. 

VIII.. EXAMPLE: THE ObR GENE IS THE db GEira 
The experiments and studies described below 
demonstrate that the obR gene maps to the db locus, and 
that the obR gene in db mice is a mutant form of obR that 
2 5 results in transcription of an aberrantly spliced mRNA 
having a 106 nucleotide insert resulting in a truncated 
long form murine ObR protein that is identical to murine 
short form ObR. 

A. . THE obR GENE MAPS WITHIN 
30 THE db GENETIC INTERVAL 

In the Example presented herein, studies are 

described which indicate that the obR gene maps to a 4 to 

5 cM region on mouse chromosome 4 which represents the 

same region to which the db locus maps. 
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1. • MATERIALS AND METHODS 
PCR Amplification . The following f am j 53 12 -derived 
primers were used for amplification of mouse genomic DNA : 

forward primer : 5 ' -GCTGCACTTAACCTGGC-3 ' 
5 reverse primer: 5 ' -GGATAACTCAGGAACG-3 ' . 

The PCR reaction mixture contained 6/xl template 
DNA (lOng/^1) , 1.4 pi 10x Perkin Elmer (Norwalk, CT) PCR 
buffer, 1.12/il dNTPs (2.5 mM) , 1.05 fil Forward primer 
(6.6 M**), 1.05 Ml Reverse primer (6.6 MM) , 0.38 m! H 2 0 
10 and 3 pi AmpliTaq Hotstart™ polymerase (Perkin Elmer; 
O.SU/mI) - 

The amplification profile was as follows: 94° C, 
2 minutes, at which point the ampliTaq was added, then 3 0 
cycles of 94° C, 4 0 seconds, 55° C, 50 seconds and 72° C, 
15 3 0 seconds. 

A second set of primers were utilized under the 
same conditions except that the 55 °C cycle was conducted 
at 52 °C: 

f orward primer : 5 ' - C ACT ATTTG C C CTT C AG — 3 ' 
20 reverse primer : 5 ' -GCCTGAGATAGGGGTGC-3 ' 

Electrophoresis . Samples were run on both 
nondenaturing 8% acrylamide gels run at 45 W, room 
temperature, for 3 hours and nondenaturing 10% acrylamide 
SSCP (single stranded conformational polymorphism) gels 
25 run at 20 W, 4° C, for 2.5 hours. 

Both types of gels were stained with SYBR Green I 
and scanned on an MD Fluor imager, and gave interpretable 
results . 
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2. . MAPPING OF THE fami5312 ObR cDNA CLOUR 

PCR primers were designed from the coding sequence 
of famj5312 cDNA, as described in Section 8.1. These 
primers amplified a 19 2 bp fragment from C57B1/6J genomic 
5 DNA, consistent with the base pair length between the two 
primers in the obR cDNA, and a 195 bp fragment from the 
wild-type derived Mus spretus strain SPRET/Ei. The 3 bp 
insertion in the Mus spretus allele codes for an 
additional Asn between amino acids #4 5 and #46. The 

10 genetic segregation of the Mus spretus 195pb allele of 
ObR was followed in 182 backcross progeny of the cross 
(C57B1/6J x Mus spretus) F x females x C57B1/6J males by 
both Single Stranded Conformational Polymorphism (SSCP) 
gel electrophesis and nondenaturing gel electrophoresis 

15 for size determination. The segregation pattern of the 
Mus spretus allele was compared to the segregation 
pattern of 22 6 other genetic loci that have been mapped 
in this backcross panel. By minimizing the number of 
multiple crossovers between obR and other markers it was 

20 determined that obR maps to murine chromosome 4, 

approximately 2.2±1.6 cM distal to the marker D4Mit9 and 
4.6±1.6cM proximal of the marker D4Mit46. The genetic 
map position of obR was further refined by mapping 
additional genetic markers. The obR gene maps 0.6+0. 6cM 

25 distal from D4Mit255 and 0.6±0.6cM proximal of D4Mitl55;. 
see FIG. 8. 

Additional primer pairs were designed (forward = 
CACTATTTGCCCTTCAG ; reverse = GCCTGAGATAGGGGTGC) from the 
3' sequence of famj5312 cDNA which also revealed a 

30 polymorphism on SSCP gels between C57B1/6J genomic DNA 

and that of the wild derived Mus spretus strain SPRET/Ei. 
Again this permitted the genetic mapping of famj5312 
cDNA, now using a different fragment of the clone. The 
mapping of this polymorphism was 100% concordant with the 

35 mapping of famj5312 reported above, both confirming the 
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mapping of obR and indicating that the famj53X2 cDNA 
clone was not chimeric. 



3. . DEFINITION OF THE MURINE db GENETIC REGION 

The mouse db gene was originally mapped to mouse 
5 chromosome 4 (Hummel, K.-P. et al., 1966, Science 
153 : 1127-1128) . This genetic localization has been 
refined (Bahary, N . et al,, 1990, Proc. Natl, Acad. Sci. 
USA 87:8642-8646; Bahary, N. et al., 1993, Genomics 
i6: 113-122) to place db within a genetic interval of 

10 l . 5cM between the proximal Ornithine decarboxylase 4 

( Odc4 ) locus and the anonymous distal markers D4Rck22 and 
D4Rck69. Bahary et al. 1993 also report D4Mit205 as 
being l.lcM proximal to Qdc4. Hence, relative to 
D4Mit205, the db gene was mapped approximately 2.2cM 

15 distal. 

The db allele originally arose on the C57Bl/BLKsJ 
inbred strain. The db mutation has subsequently been 
transferred to other genetic backgrounds to form congenic 
strains. By typing animals of the congenic strain 

20 C57Bl/6J-m db it was possible to define the genetic 

interval within which the db gene had to reside on mouse 
chromosome 4. By this analysis, the interval that must 
contain the db gene was defined as the approximate 4cM 
between the proximal anonymous DNA marker D4Mit255 and 

25 the distal markers D4Mit3 31 and D4Mit31. (Genetic 

distance as defined on the Mit map; Dietrich, W.F. et 
al., 1994, Nature Genetics 2-220-245; Copeland, N.G. et 
al., 1993, Science 262:67; Whitehead Institute/MIT Center 
for Genome Research, Genetic Map of the Mouse, Database 

30 Release 10, April 28, 1995). It should be noted that the 
interval defined by Bahary et al- 1993, supra . appears to 
be a few centimorgans proximal of the region as defined 
herein. See FIG. 8, in which the distance between 
D4Mit2 55 and D4Mit31 is about 5.1cm, 
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By comparing the mapping data for famj5312 with 
the db mapping data described above, the map position of 
famj5312, 0.6±0.6cM distal from D4Mit255 and 0.6±0.6cM 
proximal of D4Mitl55, is in complete accordance with obR 
5 being the db gene. 

B.. THE ObR MUTATION IN db MICE RESULTS 
IN A TRUNCATED LONG FORM RECEPTOR 

1. . MATERIALS AND METHODS 

Total mRNA was prepared from C57B1/KS (KS) and 
10 C57Bl/KS-db (db) choroid plexus and hypothalamus* cDNA 
was reverse-transcribed from 1 nq of cDNA of mRNA using 
random hexamer or oligo dT as primer with Superscript 
Reverse Transcriptase from GIBCO-BRL. A total 24 ug of 
cDNA was made. For PCR f cDNA was diluted 1:200 and 3 /xg 
15 of the diluted cDNA was used in a 25 ^1 reaction. 

From the mouse short form cDNA clone, famj5312, 
and the long form cDNA clone (FIG. 6) , primers were 
designed covering the entire coding region of both the 
short and long forms of obR cDNA. Overlapping PCR 
20 fragments with an average size of 600 bp were generated 
from each sample. PCR products were electrophoresed on 
an 0.8% low melting agarose gel. DNA was isolated from 
the gel and agarased. Agarased DNA fragments were 
sequenced with both end primers as well as internal 

2 5 primers. 

PCR Conditions . The 25 Ml PCR reaction contained 
2 mM MgCl 2 , 0.5 mM of each primer, 200 mM each of dATP , 
dTTP , dCTP and dGTP , and 0.5 units of Taq polymerase in 
IX Taq polymerase buffer (Perkin-Elmer ) . All PCR 

3 0 reactions were performed in the GeneAmp PCR System 9600 

(Perkin-Elmer) . Unless otherwise described, the general 
PCR profile was 94 °C for 3 minutes; 94 °C for 10 seconds, 
57°C for 10 seconds, 72 °C for 40 seconds for 35 cycles 
and 72 °C for 5 minutes for one cycle. 
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DNA sequencing and Sequence Analysis . DNA 

sequencing was performed on the automatic ABI 373A and 
377 DNA sequencer by using the Taq cycle sequencing kit 
(Applied Biosystems, Foster City, CA) • Sequence analysis 
5 was performed using Sequencher. 

2. . RESULTS 

Semi-nested PGR was performed on mRNA isolated 
from choroid plexuses of KS and cib mice. The PCR product 
generated using the sife cDNA as template was approximately 

10 100 bp longer than that using Ks DNA as template. The 
PCR products from both were directly sequenced. No 
sequence difference was detected within the coding 
sequence of the short form of the mRNA species expressed 
in the choroid plexus of these mice. However, upon the 

15 sequencing of the PCR product that was generated starting 
from the transmembrane domain shared by the two forms and 
ending in the intracellular domain specific for the long 
form, we noticed an apparent difference between db/db and 
control in several tissues. The sequencing data showed 

20 that the putative db long form of obR has an additional 
106 bp insertion in the normal long form transcript (FIG. 
9). This 106 bp includes sequence encoding the last five 
amino acids, stop codon as well as 88 bp 3' UTR region of 
the short form- The db long form produces a truncated , 

25 ObR protein identical to the short form which lacks the 
intracellular domain. We did not detect the normal long 
form in any db tissues, nor the db long form in control 
tissues . 

To understand the mechanism of this apparent 
30 splicing error, we compared the obR genomic sequence 

between the db/db and control mice. A single nucleotide 
change of G— T was discovered 2 bp immediately after the 
106 bp insertion site in db/db mice. This change creates 
a splice donor which converts the 106 bp fragment to an 
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exon inserted in the db long form. Because of this 
insertion, the db long form produces only a truncated 
protein which does not have the intracellular signal 
domain. Since the class I cytokine receptors to which 
5 ObR is most closely related all have a long intercellular 
domain, the long intercellular domain of the long form is 
crucial for initiating intracellular signal transduction. 
These data support the role of this receptor in weight 
modulation, and the failure to produce ObR long form as 
10 cause of the severe obese phenotype in db/db mice. 



IX. . EXAMPLE; CLONING HUMAN ObR EKCODING NUCLEIC ACIDS 

Described herein is the cloning and identification 
of cDNA and gemonic DNA which encode human obR. 

A. . CLONING THE HUMAN Obr cDNA 

15 The famj5312 cDNA insert was used to probe a human 

fetal brain cDNA library in the Uni-Zap XR vector 
obtained from Sratagene (La Jolla, CA) . A cDNA library 
derived from a human fetal brain was chosen because of 
the likelihood that this library would contain cDNAs 

2 0 present in the entire brain, including the choroid 

plexus, the tissue source of the mouse obR cDNA, as well 
as cDNAs present in the hypothalamus. 

The cDNA library was plated on 20 plates with 
approximately 50,000 pfu/plate. Duplicated filter lifts 

2 5 were done on each plate with Amersham Hybond-N nylon 
membrane filters. The filters were denatured, 
neutralized and cross-linked according to standard 
procedures. The probe was radioactively labelled by 
random priming in the presence of 32 P labelled nucleotide. 

30 The filters were hybridized with probe overnight at 65°C 
in Church's buffer (7% SDS, 250mM NaHP04 , 2^M EDTA, 1% 
BSA) . The next day, filters were washed in 2XSSC/0.1% 
SDS for 20 min at 65°C, then in 0 . 1XSSC/0 . 1%SDS for 10 
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min. They were then exposed to Kodak film at -80 °C for 5 
hours . 

After matching up duplicated filters , 13 
duplicated signals were obtained. Secondary plating was 
5 followed by plating out 10^1 of 1:1000 dilution of each 
primary plug. The same probe, hybridization and wash 
conditions were used as above. Film was exposed at 80°C 
for 2 hours. Only 1 of the 13 original positives gave 
duplicated signals on the film. 

10 Four independent plaques from the positive plate 

were processed and excised with ExAssist helper phage, 
XLl-Blue cells and SOLR cells as described by Stratagene. 
Excision products were then plated out on LB /Amp plates 
and incubated at 37 °C overnight. One white colony was 

15 picked up from each plate and grown in liquid LB/ Amp at 
37 °C overnight. The next day mini preps were done with 
the Promega Wizard Mini-prep kit. The sizes of the 
inserts were determined by digesting the mini-prep 
products with EcoRl and Xhol. One of the four clones (d) 

20 has an insert of approximately 6kb. 

DNA for sequencing was prepared using a Qiagen 
Plasmid Maxi kit. 

FIG, 3 depicts the nucleotide sequence (SEQ. ID. 
No: 3) of human obR cDNA encoding the signal sequence 

25 (amino acid residues 1 to about 20) , extracellular domain 
(from about amino acid residue 21 to about 83 9) , 
transmembrane domain (from about amino acid residue 84 0 
to about 862) and cytoplasmic domain (from about amino 
acid residue 863 to 1165) . 

30 B.. CLONING HUMAN obi? GENOMIC DNA 

As described herein, we have cloned human obR 
genomic DNA. 

The famj53l2 cDNA insert was used to probe human 
high density PAC filters purchased from Genome Systems 
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labelled using the Prime-It kit (Stratagene; Catalog No. 
30O392) . The hybridization was carried out in Amersham 
Rapid-hyb buffer according to the manufacturer's 
5 recommendations. The filters were then washed in 

2XSSC/1% SDS at 65°C and exposed to Kodak film at -80°C. 

Eleven putative positive PAC clones were identified. 
Their grid position was determined and the clones were 
purchased from Genome Systems, Inc. 
10 The clone at grid position P298-K6, which we have 

designated hobr-p87, was further validated as containing 
the entire ObR coding region by PCR testing with primer 
pairs from the 5'(obRF4 and obRR4) and 3' (obRS and obRO) 
ends of the obR open reading frame. The primers used in 
15 this validation were as follows: 

obR F4 : 5'- CTGCCTGAAGTGTTAGAAGA -3' 

obRR4 : 5'- G CTG AACTG ACATT AG AGGTG -3' 

obR S: 5'- ACCTATGAGGACGAAAGCCAGAGAC -3 ' 

ObR O: 5'- TGTGAGCAACTGTCCTCGAGAACT -3' 

2 0 The hobr-p87 clone was deposited with the ATCC on 

December 28 , 1995. 



X.. EXAMPLE: CONSTRUCTION OF ObR 
IMMUNOGLOBULIN FUSION PROTEINS 

A. . PREPARATION OF OBR-IG FUSION PROTEINS 

25 The extracellular portion of human ObR is prepared 

as a fusion protein coupled to an immunoglobulin constant 
region. The immunoglobulin constant region may contain 
genetic modifications including those which reduce or 
eliminate effector activity inherent in the 

3 0 immunoglobulin structure. (See, e.g. . PCT Publication No. 
WO88/07089, published September, 22, 1988). Briefly, PCR 
overlap extension is applied to join DNA encoding the 
extracellular portion of human ObR to DNA encoding the 



WO 97/19952 



PCT/US96/19128 



- 130 - 

hinge, CH2 and CH3 regions of human igGl. This is 
accomplished as described in the following subsections. 



B. • PREPARATION OF GENE FUSIONS 

PGR reactions are prepared in 100 nl final volume 
5 composed of Pfu polymerase and buffer (Stratagene) 

containing primers (1 /iM each), dNTPs (200 /xM each), and 
ing of template DNA. 

DNA fragments corresponding to the DNA sequences 
encoding the ObR ECD, or a portion thereof that binds Ob, 
10 are prepared by polymerase chain reaction (PGR) using 

primer pairs designed so as to amplify sequences encoding 
the entire human ObR ECD as well as a small amount of 5' 
noncoding sequence. For example, the forward primer: 
5 ' -GTCACGATGTCGACGTGTACTTCTCTGAAGTAAGATGATTTG-3 ' 
15 corresponds to nucleotides -20 to +8 in FIG. 3 with an 

additional 14 nucleotides (containing a Sail site) at the 
5' terminus. The reverse primer: 
5 ' - 

GTCAGGTCAGAAAAGCTTATCACTCTGTGTTTTTCAATATCATCTTGAGTGA 
20 A-3' 

corresponds to the complement of nucleotides +2482 to 
+2517 in FIG. 3, with an additional 18 nucleotides 
(containing a Hindlll site) at the 5' terminus. A cDNA 
encoding human ObR serves as the template for amplifying 

2 5 the extracellular domain. PCR amplification with these 
primers generates a DNA fragment which encodes ObR 
extracellular domain. 

In a second PCR reaction, a second set of primers 
are designed to amplify the IgG constant region (i.e., 

30 the hinge, CH2 , and CH3 , domains) such that the reverse 
primer has an unique restriction site and the sequence of 
the forward primer has a 5' terminus that is 
complementary to the 5' terminal region of the reverse 
primer used in the ObR ECD amplification, supra (i.e., 
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5 ' -AAGCTTTTCTGACCTGACNNN -3') and which will enable the 
open reading frame in the ObR encoding nucleotide 
sequence to continue throughout the length of the IgG 
nucleotide sequence to be amplified. The template DNA in 
5 this reaction is the 2 000 nucleotide segment of human IgG 
heavy chain genomic DNA (Ellison et al. , 1982, Nuc. 
Acids . Res 10:4071-4079). 

The complete human pbR-IgG fusion segment is 
prepared by an additional PCR reaction. The purified 

10 products of the two PCR reactions above are mixed, 

denatured (95°C, 1 minute) and then renatured (54°C, 30 
seconds) to allow complementary ends of the two fragments 
to anneal. The strands are filled in using dNTPs and Taq 
polymerase and the entire fragment amplified using 

15 forward PCR primer of the first PCR reaction and the 
reverse PCR primer of the second PCR reaction. For 
convenience of cloning into the expression vector, the 
resulting fragment is then cleaved with restriction 
enzymes which recognize unique designed sites in the 

2 0 forward PCR primer of the first PCR reaction and the 

reverse PCR primer of the second PCR reaction. This 
digested fragment is then cloned into an expression 
vector that has also been treated with these restriction 
enzymes . 

25 Sequence analysis is used to confirm structure and 

the construct is used to transfect COS cells to test 
transient expression. 

Those skilled in the art are aware of various 
considerations which influence the choice of expression 

30 vector into which the obR -laG fusion segment is to be 

cloned, such as the identity of the host organism and the 
presence of elements necessary for achieving desired 
transcriptional and translational control. For example, 
if transient expression is desired, the obR -laG fusion 

3 5 segment generated supra can be cloned into the expression 
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vector pcDNA-l (Invitrogen) . Alternatively, stable 
expression of the fusion protein can be achieved by 
cloning the obR-IgG fusion segment into the expression 
vector pcDNA-3 (Invitrogen) . 
5 Alternatively, mouse and/or human obR -IaG fusion 

proteins can be generated using an expression vector such 
as the CD5-IgGl vector (described by Aruffo et al. , 1990, 
Cell, 61: 1303-1313), which already contains the IgG 
constant region. According to this method, the DNA 

10 fragment encoding the ObR extracellular domain is 

generated in a PCR reaction so that the open reading 
frame encoding the ObR extracellular domain is continuous 
and in frame with that encoding the IgG constant region. 
For example, the extracellular domains (including 

15 signal peptides) of mouse and human ObR were PCR 

amplified with Extaq (PanVera Corp.)* The following 
primers were used for amplification of mouse and human 
ObR in first generation expression constructs: 

Mouse 

20 Forward primer: 5 ' -CCCAATGTCGACATGATGTGTCAGAAATTCTAT- 

3' 

Reverse primer: 5 ' -AAAAAGGATCCGGTCATTCTGCTGCTTGT COAT- 

S' 

Human 

25 Forward primer: 5 ' -CCCAATGTCGACATGGTGTACTTCTCTGAAGTA- 

3' 

Reverse primer : 5 ' -TTTTTGGATCCCACCTGCATCACTCTGGTG— 3 ' 

Each forward primer above contains a Sal I 
restriction site and each reverse primer above contains a 
30 BaraHX restriction site. After amplification using the 

mouse and human obR cDNAs as templates, the resulting PCR 
fragments were cloned into the XhoI/BamHI sites of the" 
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CD5-IgG vector (Aruffo et al., 1990, Cell). The 
resulting vectors were transiently transfected into COS 
cells and conditioned media was generated. 
Immunoprecipitation (IP) of the conditioned media with 
5 protein A and analysis by SDS PAGE revealed that the 

mouse ObR IgG fusion was expressed at greater levels than 
human ObR- IgG. To improve expression of the human ObR- 
IgG fusion, primers were designed which amplified the 
extracellular domain of human ObR (without the signal 
10 peptide) and this fragment was coligated with sequences 
encoding the signal-peptide of mouse ObR into the CD5-IgG 
vector. The following primers used for amplification of 
the human ObR ECD fragment that was fused with mouse ObR 
signal peptide: 

15 Forward primer: 

5 f — TTTAA CTTGT C AT ATCCAATTACTCCTTGGAG ATTTAAGTTGT CTTG C 3 ' 
Reverse primer : 5 ' -TTTTTGGATCCCACCTGCATCACTCTGGTG-3 ' 

After amplification, restriction enzyme digestion, and 
subcloning, the resulting construct was transiently 

2 0 expressed in COS cells. IP and SDS-PAGE analysis of the 
resulting conditioned media showed successful expression 
of the 17 0 KD human ObR IgG fusion. An alternative 
method for enhancing the expression of immunoglobulin 
fusion proteins, involves insertion of the ObR 

2 5 extracellular domain (not including the signal peptide) 

into the CD5-IgGl vector in such a manner so that the CDS 
signal peptide is fused to the mature ObR extracellular 
domain. Such a signal peptide fusion has been shown to 
improve expression of immunoglobulin fusion proteins. 



30 C. PREPARATION OF MODIFIED CH2 DOMAINS 

The nucleotide sequence of the obR-lgG gene fusion 
generated supra . can be modified to replace cysteine 



WO 97/19952 



PCT/US96/19128 



- 134 - 

residues in the hinge region with serine residues and/or 
amino acids within the CH2 domain which are believed to 
be required for IgG binding to Fc receptors and 
complement activation. 
5 Modification of the CH2 domain to replace amino 

acids thought to be involved in binding to Fc receptor is 
accomplished as follows. The plasmid construct generated 
supra „ provides the template for modifications of the 
ObR-lgCyl CH2 domain. This template is PCR amplified 

10 using the forward PCR primer described in the first PCR 
reaction supra and a reverse primer designed such that it 
is homologous to the 5' terminal portion of the CH2 
domain of IgGl except for five nucleotide substitutions 
designed to change amino acids 234, 235, and 237 

15 (Canfield, S.M. and Morrison, S.L. (1991) J. Exp . Med . 
173:1483-1491) from Leu to Ala, Leu to Glu, and Gly to 
Ala, respectively. Amplification with these PCR primers 
yields a DNA fragment consisting of a modified portion of 
the CH2 domain. In a second PCR reaction, the template 

2 0 is PCR amplified with the reverse primer used in the 

second PCR reaction supra , and a forward primer which is 
designed such that it is complementary to the Ig portion 
of the molecule and contains the five complementary 
nucleotide changes necessary for the CH2 amino acid 
25 replacements. PCR amplification with these primers yield 
a fragment consisting of the modified portion of the CH2 
domain, an intron, the CH3 domain, and 3' additional 
sequences. The complete obR-IgCyl segment consisting of 
a modified CH2 domain is prepared by an additional PCR 

3 0 reaction. The purified products of the two PCR reactions 

above are mixed, denatured (9 5°C, 1 minute) and then 
renatured (54°C, 30 seconds) to allow complementary ends 
of the two fragments to anneal. The strands are filled 
in using dNTP and Taq polymerase and the entire fragment 
3 5 amplified using forward PCR primer of the first PCR 
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reaction and the reverse PCR primer of the second PCR 
reaction. For convenience of cloning into the expression 
vector, the resulting fragment is then cleaved with 
restriction enzymes recognizing sites specific to the 
5 forward PCR primer of the first PCR reaction and the 
reverse PCR primer of the second PCR reaction. This 
digested fragment is then cloned into an expression 
vector that has also been treated with these restriction 
enzymes. 

10 Sequence analysis is used to confirm structure and 

the construct is used to transfect COS cells to test 
transient expression. hlgG ELISA is used to 
measure /con firm transient expression levels approximately 
equal to lOOng protein/ml cell supernatant for the 

15 construct. CHO cell lines are transfected for permanent 
expression of the fusion proteins. 

D. . OBR-Iq NEUTRALIZES Ob PROTEIN 

To establish whether the ObR-IgG fusion proteins 
were capable of binding and neutralizing OB protein 

20 (leptin) In vitro and in mice, large scale transient 

transf ections were performed in 293 cells using the mouse 
ObR-IgG fusion protein. The ObR-IgG protein was purified 
to near homogeneity on a protein A column and analyzed 
for its ability to inhibit the binding of an alkaline 

25 phosphatase-OB fusion protein (AP-OB) to cell surface 
ObR. 

COS cells were transiently transfected with mouse 
obR cDNA and tested for their ability to bind 0.5 nM AP- 
OB. As demonstrated in Fig. 10 purified ObR-IgG was able 
30 to potently inhibit, or neutralize, the binding of AP-OB 
fusion protein to cell surface ObR. 

Fig. 10, column 1 shows the high levels of specific 
binding observed in the absence of ObR-IgG fusion 
protein. Columns 2, 3 and 4 show the near complete 
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inhibition of binding observed with three different 
column fractions of purified ObR-IgG. 

XI . - THE OBR LONG -FORM HAS SIGNALLING CAPABILITIES 
OF IL-6 TYPE CYTOKINE RECEPTORS 

5 To address whether the cloned ObR isoforms are 

signaling competent, the ObR gene was introduced into 

established tissue culture cell lines and the cell 

response to OB treatment was compared with that mediated 

by the structurally-related IL-6 type cytokine receptors. 

10 The results presented in this example provide evidence 

that the ObR long form is a signal-transducing molecule 

and shares functional specificity with IL-6-type cytokine 

receptors - 

A, . MATERIALS AND METHODS 
1. CELLS 

COS-1, COS-7, H-35 (Baumann, et al., 1989, Ann. N.Y. 
Acad. Sci. 557:280-297), HepG2 , and Hep3B (Lai, et al. , 
1995, J. Biol. Chem. 270:23254-23257) cells were cultured 
as described. The cells were treated in medium 
containing 0.5% fetal calf serum alone or supplemented 
with 1 /iM dexamethasone, 0.1-1000 ng/ml human OB, 1000 
ng/ml mouse OB, IL-6 (Genetics Institute) or G-CSF 
(Immunex Corp.). To inhibit signaling by gpl30, the 
cells were treated with the combination of two pan- 
blocking monoclonal antibodies against human gpl30, B-R3 
(Chevalier, et al., 1995, N.Y. Acad. Sci. 762:482-484) 
and 144 (20 Mg/ml) . 

2 . . EXPRESSION VECTORS AND CAT REPORTER GENE CONSTRUCTS 

Expression vectors for the long form of human ObR 
30 and the short form of mouse ObR are described above 

(Sections 7-9). The truncated human G-CSFR(27) (Ziegler, 
et al., 1993, Mol. Cell. Biol., 13:2384-2390) and rat 
STAT1, STAT 3 and STAT5B (Lai, et al . , 1995, J. Biol. 



20 
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Chem., 270:23254-23257; Ripperger, et al., 1995 , J. Biol. 
Chem., 270:29998-30006) have been described. ObR with a 
mutated box 3 sequence (Y1141F) was generated by overlap 
extension PCR using synthetic oligonucleotides encoding 
5 the specified amino acid substitution (Higuchi, et al. , 

1988, Nucleic Acids Res., 12:5707-5717). The yl!4lF 
contains a replacement of the tyrosine at position 1141 
with phenyalamino. Plasmid SV-SPORTl (Life Technologies, 
Inc.) containing rat STAT 3 truncated by 55 carboxy- 

10 terminal residues has been generated by converting codons 
716 and 717 to two stop codons. The CAT reporter gene 
constructs, pHRRE— CAT and plL-6RE-CAT, have been 
described previously (Lia, et al. , 1995, J. Biol. Chem., 
270:23254-23257; Morella, et al . , 1995, J. Biol. Chem., 

15 270:8298-8310) . 

3 . . CELL TRANS F ECT I ON AND ANALYSIS 

COS-1, H-35 and Hep3B cells were transfected with 
plasmid DNA by the DEAE-dextran method (Lopata, et al., 

1989, Nucleic Acids Res., 12:5707-5717), HepG2 cells by 
20 the calcium phosphate method (Graham, et al. , 1973, 

Virology, 52:456-461), and COS-7 cells by the 
lipof ectaraine method. Subcultures of COS cells were 
maintained for 16 hours in serum-free medium prior to the 
activation of STAT proteins by treatment with cytokines 

2 5 for 15 min. DNA binding by STAT proteins were determined 
by EMSA on whole cell extracts as described in Sadowski, 
et al. (1993, Science, 26:1739-1744). Double stranded 
oligonucleotides for the high affinity SIEm67 (Sadowski, 
et al., 1993, Science, 26:1739-1744) and TB-2 (Ripperger, 

30 et al., 1995, J . Biol. Chem, 270:29998-30006) served as 
EMSA substrates. CAT gene-transf ected cell cultures were 
treated for 2 4 hours with cytokines or OB. CAT 
activities were quantitated by testing serial dilutions 
of cell extracts, normalized to the expression of the 
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cotransfected marker plasmid pIE-MUP (Morella, et al., 
1995, J. Biol. Chem., 270:8298-8310), and are expressed 
relative to the value of the untreated control cultures 
in each experimental series (defined as = 1.0). 
5 Quantitative cell surface binding of the AP-OB fusion 
protein (Section 6) was done essentially as outlined by 
Cheng and Flanagan (Cheng & Flanagan, 1994, Cell, 79:157- 
168) . 

B. . RESULTS AND DISCUSSION 
10 1.. OBR ACTIVATES STAT PROTEINS 

To determine whether ObR has the ability to recruit 
the cellular signaling machinery, COS cells were 
transiently transfected with expression vectors for the 
two representative forms of ObR, mouse short form (also 

15 corresponding to a mutated form detected in db/db mice) 
and human long form. Two days after transf ection , cells 
were incubated in InM human or mouse alkaline 
phosphatase-OB cell surface expression of ObR was 
detected as indicated by specific binding of the alkaline 

20 phosphatase-OB (AP-OB) fusion protein. Transf ection of 
the short form ObR resulted in approximately 10-fold 
higher binding than the long form. Scatchard 
transformation of binding data performed at multiple AP- 
OB concentrations indicated that the lower binding 

25 observed for the long form was mainly a result of reduced 
cell surface expression. The mouse short form bound both 
the murine and human ligands with an affinity of 0.7 nM, 
and the human long form bound both the murine and human 
ligands with an affinity of 1.0 nM. 

30 COS-1 cells were co-transf ected with expression 

vectors for human or mouse ObR (2 /ig/ml) and the various 
STAT protein (3 ng/ml) . Co-transf ection of the 
expression vectors for ObR and various STAT isoforms 
allowed analysis of the ligand-induced activation of 
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specific STAT proteins. The transfected cells were 
treated for 15 min. without or with murine OB (ioo ng/ml) 
and activation of DNA binding of the STAT proteins was 
identified by EMSA using the diagnostic oligonucleotide 
5 substrates STE or TB-2 . In these experiments, only the 
long form of ObR activated either endogenous COS STAT 
proteins, or the co-expressed STAT1 , STAT 3 , or STAT5B. 
Activation of all STAT isoforms by ObR was ligand 
dependent. In contrast, the short form of ObR was unable 

10 to activate any endogenous or co-transf ected STAT 

proteins despite its high surface expression. Since the 
long form of ObR activated all the STAT proteins that are 
also activated by G-CSFR, LIFR and gpl3 0 (Kishimoto, et 
al., 1995, Blood, 86:124 3-1254; Lia, et al., 1995, J. 

15 Biol. Chera. 270:23254-23257), the long form ObR was 

predicted to stimulate transcription with a specificity 
of the IL-6-type cytokine receptors. 



2 . . OBR SIGNALS INDUCE GENE EXPRESSION 

Rodent and human hepatoma cell lines have previously 
20 been utilized to define the gene-inducing action of 

ectopically-expressed hematopoietin receptors (Baumann, 
et al., Mol. Cell. Biol., 14:138-146). Consequently, 
three complementary hepatoma cell lines were applied to 
characterize ObR signaling. The long or short forms of 

2 5 ObR or human G-CSFR, were introduced into rat H-35 cells, 

together with the HRRE— CAT reporter gene construct, the 
expression of which is increased in these cells by 
signals of many hematopoietin receptors (Morella, et al., 
1995, J. Biol. Chem. , 270:8298-8310). Subcultures were 

3 0 treated for 2 4 hours with serum-free medium alone or 

containing cytokines (mOB, LIF or IL-6) with or without 
dexamethasone . The long form of ObR mediated ligand- 
dependent induction of CAT gene expression. The 
stimulatory action was synergistically enhanced by 
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dexamethasone. The cell response mediated by ObR was 
highly similar to that of the endogenous IL-6R but 
characteristically different from the endogenous LIFR. 
In contrast, the short form of ObR failed to induce gene 
5 expression, indicating that the 34 residue cytoplasmic 
domain, despite the presence of a box 1-related motif, 
was ineffective in recruitment of the cellular signaling 
components. The fact that the G-CSFR with a cytoplasmic 
domain truncated to 27 residues still induced gene 

10 transcription in the presence of ligand illustrated that 
the cells were able to respond to the signal derived from 
a short, box-l -containing cytoplasmic domain of a 
hematopoietin receptor. The lack of induction of CAT 
gene expression in G-CSFR-transf ected control cells 

15 demonstrates that H-3 5 cells do not respond to OB in the 
absence of transf ected ObR. 



3- . OBR FUNCTIONS INDEPENDENTLY OF OP13 0 

The results described above support the model that 
the long form of ObR reconstitutes a signaling pathway 
2 0 similar to that of IL-6R. Next, to determine whether 

gpl30 is part of the functional ObR, the long form of ObR 
was introduced together with HRRE-CAT or IL-6RE-CAT into 
HepG2 cells and the inhibitory effects of anti gpl30 
antibodies was assessed. 

2 5 Treatment of the transf ected HepG2 cells with either 

mouse or human OB produced a similarly strong induction 
which was in the range of that produced by IL-6 (3 0-4 0 
fold stimulation) . A dose response analysis indicated 
that maximal regulation was achieved with 100 ng/ml OB. 

3 0 In four independent experiments, it was established that 

1-5 ng/ml OB produced a half -maximal stimulation, and 
that 1000 ng/ml yielded a stimulation that was 
consistently below maximum. In the presence of 
monoclonal antibodies against human gpl30, which are 
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known to prevent signaling by all IL-6 type cytokine 
receptors (Chevalier, et al., 1995, N.Y. Acad. Sci. 
762:482-484), the stimulation of gene expression by IL-6 
was abolished as expected, whereas the regulation by OB 
5 was unaffected. These results indicate that ObR 

functions independently of gpl3 0 (insensitive to anti- 
gpl30) and that signal initiation may be triggered by 
receptor homo-oligomerization. 

4 - . BOX 3 SEQUENCE OF OBR AND ST AT 3 ARE 
10 INVOLVED IN SIGNALING 

Induction of transcription via IL-6 RE is 

characteristic of the hematopoietin receptors of IL-10R 

which contain at least one copy of the box 3 motif (YXXQ) 

in their cytoplasmic domains (Lai, et al., 1995, J. Biol, 

15 Chem. 270:23 254-23257). This box 3 sequence has been 

implicated in recruiting STAT 3 to the receptor as part of 
its activation by receptor-associated kinases (Lia, et 
al., 1995, J. Biol. Chem. 270:23254-23257); Stahl , et 
al-, 1995, Science 267:1349-1353). The long form of ObR 

20 (FIG. 3) contains at amino acid position 1141 to 1144 one 
copy of the box 3 motif that could account for the 
activation of STAT 3 and transcriptional stimulation of 
IL-6RE-CAT. To assess whether the box 3 motif of ObR and 
STAT 3 were involved in the gene inducing effect of ObR, 
5 two complementary reagents were applied: a box 3 -mutant 
ObR and a dominant negative STAT 3 . The role of box 3- 
sequence in the long form of ObR was determined by 
mutating tyrosine at amino acid position 1141 to 
phenylalanine (Y1141F) . Hep G2 and H-35 cells were 
0 transfected with an expression vector for wild-type ObR 
or ObRY1141F (2 Atg/ml) together with either pHRRE-CAT or 
pIL-6RE— CAT . Cells were treated with human OB (100 
ng/ml) , and the relative change in CAT activity was 
determined. The mutant ObR transfected into HepG2 cells 
5 yielded a lower stimulation of both the HRRE-and IL-6RE- 
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CAT reporter gene constructs than the wild-type ObR. For 
example, stimulation of HRRE— CAT expression was reduced 
40 fold in HepG2 cells and H-35 cells. Stimulation of 
I1-6RE-CAT was reduced 20-fold in HepG2 cells and 100- 
5 fold in H-3 5 cells. Control experiments indicated that 
reduced signaling activity of the mutant ObR was not due 
to compromised surface expression as shown by AP-OB 
binding. The relative effect of the mutation was more 
prominent on IL-6RE than on HRRE. A similar experiment 
10 carried out in H-3 5 cells showed that box 3 mutation was 
correlated with a loss of IL-6RE regulation, whereas HRRE 
regulation was minimally affected. The results are 
consistent with previous observations that, in some cell 
lines, the recruitment of STAT 3 was more important in 
15 gene induction through IL-6RE then through HRRE (Lai, et 
al., 1995, J. Biol. Chem. 270:23254-23257; Morella, et 
al., 1995, J. Biol. Chem. 270:8298-8310; Wang, et al., 
1995, Blood 86:1671-1679). 

The reduced gene-regulatory effect of the Y1141F ObR 
20 mutant was also correlated with a lower activation of 

STAT proteins. When the mutant ObR was transfected into 
COS-1 cells, as done for the wild-type ObR, activation of 
the endogenous COS STAT proteins was not detected. Also, 
ObR Y1141F was approximately 10 times less effective in 
2 5 activating overexpressed STAT1 and STAT 3 than wild type 
ObR. Activation of STAT5B was, however, unaffected by 
the mutation. This profile of STAT activation by ObR 
Y1141F was in agreement with that observed for box 3- 
deficient gpl30 (Lai, et al . , 1995, J. Biol. Chem. 
30 270:23254-23257) and G-CSFR (Morella, et al., 1995, J. 

Biol. Chem. 270:8298-8310) and would explain the specific 
changes in the regulation of the reporter gene 
constructs . 

The signal transducing role of STAT 3 was determined 
35 by using over-expression of STAT3**55C, a mutant STAT 3 
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with a 55 residue car boxy terminal truncation that acts 
as dominant negative inhibitor of STAT 3 action on gene 
transcription. DNA binding assays such as those 
described in Section 11.2.1. # supra . verified that the 
5 long form of ObR efficiently activated DNA binding 

activity of STAT3^55C. STAT3-*55C essentially abolished 
the ObR mediated induction of IL-6RE and reduced that of 
HRRE by 50%. These data indicate that in the hepatic 
cells, ObR engages signal transduction pathways that are 
10 also utilized by the IL-6-type cytokine receptors and ar< 
sensitive to STAT3**55C. 



5.. OBR CAN UTILIZE BOTH STAT3 AND STAT 5 B GENE 

INDUCTION 

Induction of the selected reporter gene constructs 
15 in HepG2 or H-35 cells is maximal and not significantly 
enhanced by over -expressed wild-type STAT proteins. To 
assess whether the STAT proteins activated by ObR play a 
positive mediator role, human Hep3B cells were 
transfected with human ObR together with either pIL-6RE- 

2 0 CAT or pHRRE-CAT , and the expression vector for the STAT 

proteins. Stimulation of CAT activity by human OB (100 
ng/ml) relative to untreated control was determined (mean 
± S.D.; N=3 to 4). Those hepatoma cells have retained 
expression of functional IL-6R, but lack the receptors to 
25 other IL-6-type cytokines (Baumann, et al., 1994, Mol. 
Cell. Biol. 14:138-146). Moreover, these cells have a 
relatively low level of STAT 3 and -5, thus permitting 
testing of the signaling of ObR by gain of function 
through over-expression of STAT proteins. The results 

3 0 from these experiments indicate that overexpressed STAT 3 

mediated induction of IL-6RE 15-fold. Overexpressed 
STAT 3 1 and STATS B enhanced ObR mediated induction of 
HRRE- CAT 5-fold and 3 0-fold, respectively. 
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C. . CONCLUSION 

The results presented above documents that full 
length ObR is a signal transducing receptor with a mode 
of action related to the IL-6-type cytokine receptors. 
5 The data also support the hypothesis that the truncated 
ObR variants, such as the short form expressed in many 
tissues or encoded by the c?jb mutant transcript, are 
either signaling-incompetent or exert a reduced signaling 
repertoire that is not detectable by the tools applied 
10 here. The fact that reconstitution of an OB response is 
achieved at the level of gene expression in hepatic cells 
strongly suggests that an equivalent process may occur in 
hypothalamic cells or other cell types that normally 
express the full-length ObR. The link of ObR to specific 
15 signaling pathways utilizing STAT proteins and the 

knowledge of the specificity of these proteins to control 
genes through identifiable DNA binding elements may 
assist in identifying the immediate ObR effects that are 
relevant to understanding OB action in vivo. The 
experimental system presented above can also be used to 
address questions about the functional role, if any, of 
the naturally occurring short forms of ObR in functional 
regulation of the long form. 



20 



25 



12. MUTATIONAL ANALYSIS OF OBR 

In order to identify regions of the ObR cytoplasmic 
region important for activation of genes, a number of ObR 
mutants were created and analyzed. These studies, 
described below, identified two distinct regions of the 
ObR cytoplasmic domain important for induction of gene 
3 0 expression. 

12 * MATERIALS AND METHODS 



12.1.1 CELLS 
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COS-1, COS-7 and H-35 cells were cultured as 
described by Baumann et ah, 1989, Ann. N.Y. Acad. Scl. 
557:28 0-2 97. Cells were mock stimulated in medium 
containing 0.5% fetal calf serum and 1 M M dexamethasone 
or treated in the same medium supplemented with 100 ng/ml 
human leptin (Roche) , IL-6 (Genetics Institute) , or G-CSF 
(Immunex Corp.) . 



12*1.2 EXPRESSION VECTORS AND CAT 
REPORTER GENE CONSTRUCTS 



10 



The expression vectors for the long form of 
human ObR are described above (Section 9) and rat STAT1, 
STAT 3 and STATS B have been described previously (Lai et' 
al., 1995, J. Biol. Cham. 270:23254-23257; Ripperger et 
al., 1995, J. Biol. Chem. 270:29998-30006). POB-Ra1115- 
15 1165, pOB-RA 1065-1165 and P OB-Ra965-1165 , all encoding 
carboxy-terminal truncated human ObRs, were generated by 
PCR. Briefly, oligonucleotides spanning the 
intracellular domain of human ObR were used to generate 
in-frame stop codons 3' to the specified amino acids. 
2 0 The PCR fragments were digested with EcoRV and Xbal and 
subcloned into human ObR that had been digested with 
EcoRV and Xbal. A similar strategy was used to generate 
POB-Ra868 but with primers generating an Mscl-Xbal 
fragment that replaced endogenous human ObR sequences. 
2 5 P0B-RY1141F, encoding human ObR with a mutated box 3 
sequence was prepared as described in Section 11.1.2. 
ObR mutants pOB-R(boxlmt) , containing PNP to SNS changes 
in the ObR box 1 motif (aa 876 and 878) , and mutants pOB- 
RY986F and pOB-RY1079F, were generated by overlap 
30 extension PCR using synthetic oligonucleotides encoding 
the specified Tyr to Phe amino acid substitutions 
(Higuchi et al., 1988, Nucleic Acids Res. 16:7351-7367). 
The CAT reporter gene constructs, pHRRE— CAT and pIL-6-CAT 
have been described previously (Lai et al. , 1995, J. 
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Biol. Chem. 270:23254-23257; Morella et al , 1995, J*. 
Biol. Chem. 270:8298-8310). 

12-1-3 CELL TRANS FECT ION AND AK^ T.Vb t a 

COS-1 and H-3 5 cells were transfected by the 
DEAE-dextran method (Lopata e t al., 1984, Nucleic Acids 
Res. 12:5707-5717) , and COS-7 cells by the lipof ectamine 
method (Tartaglia et al. , 1995, Cell 83:1263-1271). For 
analysis of STAT protein activation, COS cells were 
maintained for 16 h in serum-free medium, followed by 
treatment of cells with 100 ng/ml leptin or G-CSF for 15 
min. 

For CAT assays, transfected cell cultures were 
subdivided and treated with ligands for 24 hours. CAT 
reporter activities were determined and are expressed 

15 relative to values obtained for untreated control 

cultures for each experimental series. DNA binding by 
STAT proteins was analyzed by electromobility shift assay 
( EMS A) using whole cell extracts as described by Sadowski 
et al. (1993, Science 26:1739-1744). Radiolabeled double 

20 stranded oligonucleotides SIEm67 (for STAT1 and STAT 3 ) 

and TB-2 (for STAT5B) served as binding substrates in the 
EMSA. Receptor expression in COS cells was analyzed by 
quantitative cell surface binding of AP-OB fusion protein 
as described by Cheng and Flanagan (1994, Cell 79:157- 

25 168). 

12.1.4 IMMUNQBIiQTTlNG 

All immunoblotting was done as described by 
Baumann et al. (1996, Proc. Natl . Acad. Sci . U.S.A. 
93:xxx-xxx) and immunoreactive proteins were visualized 
3 0 by enhanced chemi luminescence detection as described by 
the manufacturer (Amersham) . Rabbit polyclonal antiserum 
specific for STAT 5 B was from Santa Cruz Biotechnology. 
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12.2 RESULTS AND DI8CUSSIQW 

As discussed above, ObR is a member of the class I 
cytokine receptor superfamily. Receptors of this class 
lack intrinsic tyrosine kinase activity and are activated 
5 by ligand- induced receptor homo-dimerization or hetero- 
dimerization. In many cases, activation requires 
activation of receptor-associated kinases of the Janus 
family (JAKs) (Ihle et al. , 1994, Trends. Biol. Sai . 
19:222-227). JAKs associate with the membrane-proximal 

10 domain of the intracellular part of the cytokine 

receptors, and serve to initiate signal transduction 
pathways following ligand induced receptor activation. 
Included among the downstream targets of the JAK proteins 
are members of the STAT (Signal Transducers and 

15 Activators of Transcription) family of transcription 
factors (Ihle et al., 1994, Trends. Biol. Sci . 19:222- 
227). The STATs are DNA binding transcription factors 
that contain Src-homology (SH2) domains that interact 
with receptor molecules through phosphorylated tyrosine 

2 0 residues. STAT proteins are activated by tyrosine 

phosphorylation, form heterodimers or homodimers, 
translocate to the nucleus and modulate transcription of 
target genes. 

12.2.1 THE OBR INTRACELLULAR DOMAIN INCLUDES AT LEAST TWO 

25 REGIONS IMPORTANT FOR SIGNALLING 

To define regions of the ObR cytoplasmic domain 
required for signaling, a series of C-terminal deletion 
mutants were constructed (Fig. HA) . These cDNAs 
encoding these mutants were transiently co-transf ected 

3 0 into H-3 5 cells with either I L- 6 RE— CAT or HRRE— CAT 

reporter constructs and assayed for their ability to 
stimulate transcription (Fig. 11B) . C-terminal 
truncations that remove box 3 sequences (aa 1141-1144) of 
ObR abolish transcriptional activation via IL-6-RE (Fig. 
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11B; upper panel) . This result is consistent with the * 
fact that a Y to F mutation in the single box 3 motif or 
ObR completely disrupts signaling in H-35 cells via IL- 
6RE (Section 11.2,4). In contrast, ObR signaling through 
5 HRRE was minimally reduced by removal of extreme c- 

terminal sequences and was not completely disrupted until 
removal of the approximately 97 amino acids between 868 
and 965 (Fig. 11B) . 

To insure that the expression vectors for the 
10 various ObR mutants directed the synthesis of surface 
localized receptor proteins, COS cells transfected with 
each construct were assayed for receptor expression by 
AP-OB binding studies. C-terminal truncations of ObR 
generate proteins that are expressed at the surface and 
15 bind ligand (Fig. 12) . Moreover, the expression level of 
ObR increased with progressive truncation of the 
intracellular domain. 

As discussed above, ObR gene induction via IL-6RE 
correlates with activation of STATl and STAT 3 whereas ObR 
20 gene induction via HRRE was found to correlate with 
activation of STATS B • To further evaluate the 
correlation between HRRE stimulation and STAT 5 B 
activation, COS cells were co-transf ected with expression 
plasmids for STATS B and the ObR deletion mutants. 
2 5 Immunoblotting performed on extracts prepared from these 
cells revealed that STAT5B was expressed at relatively 
equal amounts in each of the transfected cultures. Cells 
were treated with leptin. EMSA analysis was performed, 
and STAT protein levels were quantitated by Western 
30 blotting. Progressive C-terminal truncations of ObR 
result in a reduced ability to activate STAT SB and 
detectable STATS B activation was lost only with removal 
of the membrane proximal ObR segment (construct pOBR&868- 
1165) . Thus, there appears to be a correlation between 
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loss of ObR STAT5B activation and gene induction via 
HRRE. 

To define the relative contribution of the conserved 
intracellular domain tyrosine residues and of the 
5 membrane proximal box 1 motif to signaling by ObR via 
HRRE, mutants OB-RY1141F, OB-RY986F, OB-RY1079F and OB- 
R(box lint) were generated (Fig. 13A) . When analyzed in 
COS cells, AP-OB binding studies demonstrate that these 
mutants are expressed at the cell surface approximately 

10 as well as wild-type ObR. When transfected into H-35 
cells, OB-RY986F and OB-RY1079F were unchanged in their 
ability to regulate HRRE (Fig. 13B) . In contrast, 
mutation of the ObR box i motif results in a complete 
loss of regulation of gene induction through this 

15 element. Thus, the box l motif of ObR appears to be an 
important determining factor for the ability of ObR to 
activate pathways that can modulate gene induction via 
HRRE. 

Gene induction by ObR through IL-6RE requires 
2 0 sequences near the extreme C-terminus of ObR (Fig. hb) . 
In contrast, ObR gene induction through HRRE does not 
appear to require these C-terminal sequences. Moreover, 
gene induction via this element is only minimally 
effected by removal of ObR intracellular domain sequences 
of approximately 200 amino acids between amino acids 965- 
1165 but is dependent upon membrane proximal sequences of 
the approximately 17 amino acids between amino acids 868 
and 965. Consequently, the proposed box 2 motif of ObR 
(Lee et al., 1996, Nature 379:632-635) (human ObR aa 
0 1066-1075) does not appear to contribute to gene 

induction through HRRE. EMSA analysis suggests gene 
induction of HRRE correlates with the ability of ObR to 
activate STATS B . Interestingly, 0B-Ra965-1165, which has 
been deleted of all intracellular domain tyrosine 
5 residues and therefore all potential SH2 docking sites, 



25 
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x. still capable of low-level STAT5B activation and 
transcriptional stimulation through HRRE . only n 

;^ ne Pr ° Ximal of obR are ^ (^Laea- 

1165 , are both HRRE gene induction and STAT5B activttxl 

:::r \ y aboiished - consistent ^ «-» *ox 

imt, contaxn^g a mutated box i motif, is nimLl ' 
unable to induce gene induction through HRRE and would be 
predicted to be unable to activate STAT5B . 

13 * MPXTIMERIZATIOM nv nn H 

10 The primary structure of obR suggests that it is 

closely related to the signaling subunits of the l L - 6 
type cytokine receptors. Members of this group can be 
activated by either heterodimerization or 
homodimerization (Kishimoto et al., 1994, cell 76-253 

among the former are the receptors for l L - 6 , leu]cemia 
inhibitory factor (LIF) , oncostatin M, il-ii, and ciliary 

s n rgnrrt°ran C d faCt0r ^ ' ^ °* ^ ~- - 

signal transducer, gpi 3 0 (Kishimoto et al. , i 994/ cell 

20 76:253- 262 ; Taga et al . , ^ ^ 58:573 . 581) ' 

However, previously we have found that ObR appears to 
signal independently of g P i 30 (Baumann et al i 996 

me *.„. Acad . sci . UmSaAm 93:xxx _ xxx Th ; refor ; 

may function in the presence of another accessory chain 
such as the common signaling subunit utilized by 
receptors for either l L -3 , granulocyte macrophage-colony 
stimulating factor (GM-CSF) and IL-5 (IL-3R0) , or 1L _ 2 
IL-4 IL-7 and IL- 9 (IL-2Ry>. However, ObR signals in' 
hepatoma cells, which do not express either li^R/j or i L - 
30 Ry (Wang et al., 1995 , SIood 86 : 1671- 1679 ; Morella et 

al i 995 , j . Biol. chem. 270 : 82 9 8-83 10 > . Alternatively 
ObR may be activated by homodimerization as is found for 
the granulocyte-colony stimulating factor receptor (g- 
CSFR) (Fukanaga et al., l 99 l, BMBO J . 10:2855-2865- 
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Ishezaka-Ikeda et al., 1993, Prac . Natl. Acad. Scl , 
U.S.A. 90:123-127). Therefore, to determine whether ObR 
has the ability to dimerize and signal as a homodimer, 
chimeric receptors encoding the extracellular domain of 
5 G-CSFR joined to the intracellular domain of ObR or the 
reciprocal receptor having the extracellular domain of 
ObR joined to the intracellular domain of G-CSFR were 
constructed and analyzed (Fig. 14A) . 

13.1 MATERIALS AND METHODS 

10 13.1.1 CELLS 

COS-1, COS-7 and H-35 cells were cultured as 
described by Bauroann et al., 1989, Ann. N.Y. Acad. Sci . 
557:280-297. Cells were mock stimulated in medium 
containing 0.5% fetal calf serum and l /iM dexamethasone 
15 or treated in the same medium supplemented with 100 ng/ml 
human leptin (Roche) , IL-6 (Genetics Institute) , or G-CSF 
( Immunex Corp „ ) . 

13.1.2 EXPRESSION VECTORS AND CAT 
REPORTER GENE CONSTRUCTS 

20 The expression vectors for the long form of 

human ObR are described above (Section 9) , full-length G- 
CSFR or truncated G-CSFR (Acyto) (Ziegler et al., 1993, 
Mol. Cell. Biol. 13:2384-2390), and rat STAT1 , STAT 3 and 
STATS B have been described previously (Lai et al., 1995, 

25 J. Biol. Chom. 270:23254-23257; Ripperger et al. , 1995, 

J. Biol. Chem. 270:29998-30006). As used herein the term 
"Acyto" means deletion of the cytoplasmic domain. The G- 
CSFR/ObR chimeric receptor was generated by PCR and 
encodes the extracellular domain of human G-CSFR (aa 1- 

30 598) joined near the transmembrane and intracellular 
domain of human ObR (aa 829-1165). The ObR/G-CSFR 
chimeric receptor was generated by PCR and encodes the 
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mouse ObR extracellular domain and transmembrane 
sequences (aa 1-860) joined to the intracellular domain 
of the human G-CSFR (aa 631-813). The CAT reporter gene 
constructs, pHRRE-CAT and plL-6-CAT have been described 
5 previously (Lai et al., 1995, J. Biol. Chem. 270:23254- 
23257; Morella et al, 1995, J . Biol. Chem. 270:8298- 
8310) . 

13.1-3 CELL TRANS FECT ION AND ANALYSTS 

COS-1 and H-35 cells were transfected by the 
10 DEAE-dextran method (Lopata et al., 1984, Nucleic Acids 
Res. 12:5707-5717), and COS-7 cells by the lipof ectamine 
method (Tartaglia et al., 1984, Cell 83:1263-1271). For 
analysis of STAT protein activation, COS cells were 
maintained for 16 hours in serum-free medium, followed by 
15 treatment of cells with 100 ng/ml leptin or G-CSF for 15 
minutes . 

For CAT assays, transfected cell cultures were 
subdivided and treated with ligands for 24 hours. CAT 
reporter activities were determined and are expressed 

2 0 relative to values obtained for untreated control 

cultures for each experimental series. DNA binding by 
STAT proteins was analyzed by electromobility shift assay 
(EMSA) using whole cell extracts as described by Sadowski 
et al. (1993, Science 26:1739-1744). Radiolabeled double 

2 5 stranded oligonucleotides SIEm67 (for STAT1 and STAT 3 ) 

and TB-2 (for STATS B) served as binding substrates in the 
EMSA. Receptor expression in COS cells was analyzed by 
quantitative cell surface binding of AP-OB fusion protein 
as described by Cheng and Flanagan (1994, Cell 79:157- 

30 168) . 



13.1.4 IMMUNOBLOTTING 

All immunoblotting was done as described by 
Baumann et al. (1996, Proc « Natl. Acad. Sci . U.S.A. 
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93:xxx-xxx) and immunoreactive proteins were visualized 
by enhanced chemiluminescence detection as described by 
the manufacturer (Amersham) . Rabbit polyclonal antiserum 
specific for STATS B was from Santa Cruz Biotechnology. 
5 Goat polyclonal antiserum against bacterially expressed 
extracellular domain of G-CSF-R was prepared at Roswell 
Park Cancer Institute Springville Laboratories. 

13.2 RESULTS AND DISCUSSION 

The experiments described below suggest that, while 
10 dimerization of the ObR cytoplasmic domain may be 

sufficient for signal transduction, higher order homo- 
oligomers can abe formed in response to ligand binding. 

13.2.1 HOMODIMERIZATION OBR INTRACELLULAR 
DOMAINS MAY BE SUFFICIENT FOR SIGNAL TRANSDUCTION 

15 Since chimeric receptor complexes have proven quite 

productive for the analysis of the mechanism of cytokine 
receptor activation (Morella et al. f 1995, J*. Biol. Chem. 
270:8298-8310; Vigon et al., 1993, Oncogene 8:2607-2615; 
Baumann et al., 1994 , Mol . Cell. Biol. 14:138-146), 

20 ObR/G-CSFR and G-CSFR/ObR chimeras were produced and 
studied as a means to analyze the mechanism of ObR 
signaling (Fig. 14A) . To analyze whether the G-CSFR/ObR 
chimeric receptor could propagate a ligand induced signal 
comparable to that for wild-type ObR, the chimera was 

25 tested for STAT activation and for transcriptional 

stimulation. Co-transf ection of G-CSFR/ObR with STAT 
proteins yielded a G-CSF-induced activation of STAT1, 
STAT 3 and STAT 5 B . This result is similar to the STAT 
protein activation induced by OB in ObR transfected cells 

3 0 (Section 12) . Expression of the chimeric receptor was 
confirmed by immunoblot analysis of cultures transfected 
with G-CSFR/ObR. These results suggest that G-CSF 
mediated dimerization of ObR cytoplasmic domains can 
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generate an ObR-type activation of STAT proteins. in 
addition, it was found that the G-CSFR/ObR chimera could 
stimulate transcription as detected by measurement of 
gene induction in H-3 5 cells following receptor co- 
5 transfection with the IL-6RE and HRRE reporter constructs 
(Fig- 14B) . The was response elicited was found to be 
similar to an induction of the reporter gene constructs 
by either ObR or endogenous IL-6R. 

These results indicate that homodimerization of two 

10 ObR cytoplasmic domains can initiate signaling by ObR, 

similar to the mechanism mediating signaling by wild-type 
G-CSFR. However, the G-CSFR/ObR chimera could not 
definitively prove that OB ligand has the capability to 
dimerize ObR extracellular domains. Consequently, 

15 signaling activity by the reciprocal chimera containing 
the ObR extracellular domain joined to the G-CSFR 
intracellular domain was analyzed (Fig. 14A) . Indeed, 
the ObR/G-CSFR chimera could mediate gene induction 
comparable to that by wild-type ObR, G-CSFR/ObR and wild- 

20 type G-CSFR (Fig. 14B) . Thus, taken together, these 
results suggest that ObR does not require an accessory 
chain for signaling and that aggregation of two ObR 
intracellular domains appears sufficient for receptor 
activation. 

25 The fact that aggregation of two ObR intracellular 

domains is sufficient to generate a signal following 
ligand- induced activation suggests that ObR may function 
by receptor homodimerization. If so, signaling by ObR 
might be "poisoned" by overexpression of a homodimer izing 

30 partner that is signaling deficient, similar to what has 
been shown for members of the receptor tyrosine kinase 
family (Paulson et al., 1989 , J. Biol, Chem. 264:17615- 
17618; Svensson et al . f 1990, J . Biol. Chem . 265:20863- 
20868; Wen et al., 1992, J . Biol. Chem. 267:2512-2518; 

35 Fantl et al., 1993, Annu . Rev. Bioohem. 62:453-481). As 
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discussed above (Section 12), ObR containing only the 
membrane proximal 6 amino acids of the cytoplasmic domain 
is signaling defective (Fig. 11B) . Consequently, 
experiments were performed to determine whether 
5 expression of a truncated, signaling deficient ObR could 
disrupt signaling by full-length ObR. Cells were co- 
transfected with increasing amounts of truncated receptor 
OB-Ra868-1165 relative to full-length ObR and the ability 
of these complexes to stimulate expression of a reporter 

10 gene construct was assayed. Co-transf ection of 

increasing amounts of truncated ObR does result in 
decreased signaling by wild-type receptor (Fig. ISA) . 
However, even at a large excess of truncated to full- 
length receptor, the signaling repression observed did 

15 not approach the degree of reduction observed for 
repression of G-CSFR signaling by overexpressed and 
signaling-deficient truncated G-CSFR (Acyto) (Fig. 15A and 
Fig. 15C) . The differing sensitivity to dominant 
negative repression observed for ObR and G-CSFR was a 

2 0 property of their extracellular domains as shown by 

dominant negative studies with the receptor chimeras 
(Fig. 15B and Fig. 15C) . 

A potential explanation for this weak dominant 
negative repression of ObR is that interaction of two ObR 
25 molecules may require functional domains residing in the 
intracellular region of the receptor. To address this 
possibility, the dominant negative repression of ObR by a 
mutant receptor rendered signaling defective by a single 
amino acid substitution (Y1141F) in the ObR box 3 motif 

3 0 was examined. As described above, this mutation 

completely abolished the ability of ObR to modulate gene 
induction via IL-6RE in H-35 cells (Section 12), 
Consequently, the ability of OB-R(Y1141F) to inhibit 
wild-type ObR signaling via this enhancer element was 
3 5 investigated. These studies revealed that increasing the 
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ratio of transfected mutant OB-RYH4 IF to wild-type 
receptor did not strongly repress signaling (Fig. i 5E ) . 
Thus, the ObR box 3 mutant and OB-Ra868-H65 behave 
similarly in their ability to trans-repress signaling by 
wild-type ObR. interestingly, low level expression of 
either truncated or box 3 mutant ObR receptor generates a 
slight enhancement of signaling by wild-type ObR. 
Moreover, a similar pattern was also observed for ObR/G- 
CSFR signaling in the presence of increasing amounts of 
truncated OB-Ra868-1165 (Figs. 15A, 15B and 15C) . 

As discussed above (Section 11) , ObR can signal in 
hepatoma cells in the presence of neutralizing antibodies 
to the gpl30 signal transducing component of the IL-6- 
type cytokine receptors. Moreover, these hepatoma do not 
15 express the other characterized cytokine receptor 

accessory chains IL-2Ry or IL-3R/3 (Wang et al. , 1995, 
Blood 86:1671-1679; Morella et al., 1995, J\ Biol. Chem. 
270:8298-8310). Consequently, it is possible that ObR 
may function by a mechanism involving receptor 

2 0 homodimerization. Among members of the class I cytokine 

receptor family, signaling by the G-CSFR is predicted to 
be initiated by ligand-induced receptor homodimerization 
(Fukanaga et al., 1991, EMBO J. 10:2855-2865; Ishezaka- 
Ikeda et al. , 1993, Proc. Natl. Acad. Sci . U.S.A. 90:123- 
25 127). As stated above, chimeric receptor complexes have 
proven quite productive for the analysis of the mechanism 
of cytokine receptor activation (Morella et al., 1995, 
supra; Vigon et al. , 1993, supra; Baumann et al . , 1994, 
supra) , ObR/G-CSFR and G-CSFR/ObR chimeras were produced 

3 0 and studied as a means to analyze the mechanism of ObR 

signaling. These studies revealed that the G-CSFR/obR 
chimera can strongly activate transcription of both the 
I Li -6 RE— CAT and HRRE-CAT reporter constructs (Fig. 14B) . 
Since G-CSFR is thought to form a homodimer when G-CSF is 
3 5 bound, this implies that the aggregation of two 
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intracellular ObR domains is sufficient to initiate 
receptor signaling. In a similar manner, the ObR/G-CSFR 
chimera also mediates transcriptional activation, through 
IL-6RE and HRRE (Fig. 14B) . These results show that 
5 leptin binding can dimerize two ObR extracellular chains 
thus inducing the association of at least two 
intracellular G-CSFR domains and activation of the 
receptor complex. Moreover, these results suggest that 
it may be possible to generate small molecules or 
10 antibodies that act as ObR agonists through simple 
crosslinking of two ObR chains. 

As would be predicted for receptors that are 
activated by simple homodimerization, signaling by full 
length G-CSFR and the G-CSFR/ ObR chimera can be greatly 
15 diminished by co-expression of a signaling deficient 
homodimerizing partner. However, OB-Ra 868-1165 was 
unable to as efficiently repress signaling by full-length 
ObR or the ObR/G-CSFR chimera. It is therefore possible 
that leptin binding to cell surface receptors can result 
20 in higher-order oligomerization (receptor 

number>2/complex) as has been shown for IL-10 receptor 
complexes (Tan et al. f 1995, J . Biol. Chem. 21:12906- 
12911) and for members of the Activin/TGF— /?R family 
(Brand et al., 1993, J . Biol. Chem. 2 68:11500-11503; 
25 Weiser et al., 1993, Mol . Cell. Biol. 13:7239-7247; Wrana 
et al., 1994, Cell 71:1003-1014; Moustakas et al., 1993, 
J. Biol. Chem. 268:22215-22218; Henis et al. f 1994, J. 
Cell Biol. 126:139-154). According to this model, ligand 
binding by full-length ObR or ObR/G-CSFR chimera can lead 
3 0 to aggregation of more than two receptor chains, yet 
juxtaposition of only two intracellular domains is 
sufficient for signal generation. Such complexes would be 
predicted to be highly resistant to dominant negative 
repression. The strong repression of signaling by G- 
35 CSFR(Acyto) in complexes containing the G-CSFR/ObR 
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chimera demonstrates that ObR intracellular domain can be 
efficiently repressed when placed in the context of a 
simple homodiroer structure. Although it is possible that 
OB-Ra 868-1165 localizes to a different region of the 
membrane than wild-type obR, it is not likely that 
mutation of a single tyrosine residue of the ObR 
intracellular domain (Y1114F) would result in altered 
receptor membrane localization. Thus, our observation of 
similar repression effects mediated by either OB-Ra8 68- 
1165 or 0B-RY1141F suggests our results are not due to 
altered membrane localization. Low expression levels of 
either OB-Ra868-1165 and OB-RY1141F generate a small 
enhancement of signaling for full length ObR and the 
ObR/G-CSFR chimera. We speculate that this effect is 
15 attributable to either ligand presentation (Andres et 

al., 1989, J" . Cell Biol. 109:3137-3145; Massaugue et al., 
1992, Cell 69:1067-1070; Lin et al. , 1993, Trends. Cell 
Biol. 3:14-19) or ligand passing as has previously been 
observed for the TNF receptor (Tartaglia et al., 1993, J. 
20 Biol. Chem. 268:18542-18548). 

As noted above, it is possible that the short forms 
of ObR serve a transport or clearance function in the 
body (Tartaglia et al. , 1995, Cell 83:1263-1271). 
However, the possiblity that the long and short forms of 

2 5 ObR can functionally interact raises suggests that the 

short form of ObR could regulate activities of the long 
form. This is supported by the fact that the major 
naturally occurring non-signalling short form of ObR in 
the mouse (containing a 3 4 amino acid intracellular 

3 0 domain) , which also corresponds to the mutant ObR found 

in the db/db mouse, can repress long form receptor 
signaling. 



14. DEPOSIT OF MICROORGANISMS 
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The following microorganism was deposited with the 
American Type Culture Collection (ATCC) , Rockville, 
Maryland, on the dates indicated and were assigned the 
indicated accession number: 



Microorganism 

E. coli strain 
1995 

5312B4F3 

10 E. coli 

h-ObRD 
1995 

Coli 



h-ObR-p87 
15 1995 



Clone 

f amj5312 

fahj5312d 
h-ObR-p87 



ATCC 
Accesa. No. 

69952 



69963 



69972 



Date 
of Deposit: 

November 22, 



December 5, 



December 28, 



The present invention is not to be limited in scope 
by the specific embodiments described herein, which are 
intended as single illustrations of individual aspects of 
the invention, and functionally equivalent methods and 
components are within the scope of the invention. 
Indeed, various modifications of the invention, in 
addition to those shown and described herein will become 
apparent to those skilled in the art from the foregoing 
description and accompanying drawings- Such 
modifications are intended to fall within the scope of 
the appended claims- 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: Tartaglia, Louis A. 

Tepper, Robert I. 
Culpepper, Janice A. 
White, David W. 

<ii) TITLE OF INVENTION: THE OB RECEPTOR AND METHODS FOR THE 
DIAGNOSIS AND TREATMENT OF BODY WEIGHT DISORDERS 

(iii) NUMBER OF SEQUENCES: 50 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Fish & Richardson P.C. 

(B) STREET: 225 Franklin Street 

(C) CITY: Boston 

(D) STATE: Massachusetts 

(E) COUNTRY: U.S.A. 

(F) ZIP: 021001-2804 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

<D) SOFTWARE: WordPerfect (Version 5.1) 

(vi) CURRENT APPLICATION DATA: 

<A) APPLICATION NUMBER: US 08/708,123 

(B) FILING DATE: 03-SEP-1996 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/638,524 

(B) FILING DATE: 26-APR-1996 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/599,455 

(B) FILING DATE: 22-JAN-1996 

(vii) PRIOR APPLICATION DATA: 
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(A) APPLICATION NUMBER: US 08/583,153 

(B) FILING DATE: 28-DEC-1995 



(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/570,142 
5 (B) FILING DATE: ll-DEC-1995 

<vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/569,485 

(B) FILING DATE: 08-DEC-1995 

(vii) PRIOR APPLICATION DATA: 
10 (A) APPLICATION NUMBER: US 08/566,622 

(B) FILING DATE: 04-DEC-1995 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/562,663 

(B) FILING DATE: 27-NOV-1995 

15 (viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Meikeljohn, Ph.D., Anita L . 

(B) REGISTRATION NUMBER: 35,283 

<C> REFERENCE /DOCKET NUMBER: 07334/019001 



(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (617) 542-5070 

(B) TELEFAX: (617) 542-8906 
(C) TELEX: 200154 



(2) INFORMATION FOR SEQ ID NOxl: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3097 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : double 

( D ) TOPOLOGY : unknown 



(ii) MOLECULE TYPE: cDNA 

30 { ix ) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 61.. 2742 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

GTCGACCCAC GCGTCCGGAG GAATCGTTCT GCAAATCCAG GTGTACACCT CTGAAGAAAG 
60 

ATG ATG TGT CAG AAA TTC TAT GTG GTT TTG TTA CAC TGG GAA TTT CTT 
5 108 

Met Met Cye Gin Lys Phe Tyr Val Val Leu Leu Hi 3 Trp Glu Phe Leu 
15 10 15 

TAT GTG ATA GCT GCA CTT AAC CTG GCA TAT CCA ATC TCT CCC TGG AAA 
156 

.0 Tyr Val lie Ala Ala Leu Asn Leu Ala Tyr Pro He Ser Pro Trp Lys 

20 25 30 

TTT AAG TTG TTT TGT GGA CCA CCG AAC ACA ACC GAT GAC TCC TTT CTC 
204 

Phe Lys Leu Phe Cys Gly Pro Pro Aan Thr Thr Asp Asp Ser Phe Leu 
5 35 40 45 

TCA CCT GCT GGA GCC CCA AAC AAT GCC TCG GCT TTG AAG GGG GCT TCT 
252 

Ser Pro Ala Gly Ala Pro Aen Asn Ala Ser Ala Leu Lys Gly Ala Ser 
50 55 60 

0 GAA GCA ATT GTT GAA GCT AAA TTT AAT TCA AGT GGT ATC TAC GTT CCT 
300 

Glu Ala He val Glu Ala Lys Phe Asn Ser Ser Gly He Tyr Val Pro 
65 70 75 80 

GAG TTA TCC AAA ACA GTC TTC CAC TGT TGC TTT GGG AAT GAG CAA GGT 
5 348 

Glu Leu Ser Lye Thr Val Phe His Cys Cys Phe Gly Asn Glu Gin Gly 
85 90 95 

CAA AAC TGC TCT GCA CTC ACA GAC AAC ACT GAA GGG AAG ACA CTG GCT 
396 

0 Gin Asn Cys Ser Ala Leu Thr Asp Asn Thr Glu Gly Lys Thr Leu Ala 
100 105 no 

TCA GTA GTG AAG GCT TCA GTT TTT CGC CAG CTA GGT GTA AAC TGG GAC 
444 
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Ser Val Val Lys Ala Ser Val Phe Arg Gin Leu Gly Val Aan Trp Aap 
115 120 12S 

ATA GAG TGC TGG ATG AAA GGG GAC TTG ACA TTA TTC ATC TGT CAT ATG 
492 

5 He Glu Cys Trp Met Lys Gly Aap Leu Thr Leu Phe He Cys His Met 
13° 135 140 

GAG CCA TTA CCT AAG AAC CCC TTC AAG AAT TAT GAC TCT AAG GTC CAT 
540 

Glu Pro Leu Pro Lys Asn Pro Phe Lys Asn Tyr Asp Ser Lys Val His 

10 145 150 155 160 

CTT TTA TAT GAT CTG CCT GAA GTC ATA GAT GAT TCG CCT CTG CCC CCA 
588 

Leu Leu Tyr Asp Leu Pro Glu Val He Asp Asp Ser Pro Leu Pro Pro 
165 170 175 

15 CTG AAA GAC AGC TTT CAG ACT GTC CAA TGC AAC TGC AGT CTT CGG GGA 
636 

Leu Lys Asp Ser Phe Gin Thr Val Gin Cys Asn Cys Ser Leu Arg Gly 
ISO 185 190 

TGT GAA TGT CAT GTG CCG GTA CCC AG A GCC AAA CTC AAC TAC GCT CTT 
20 684 

Cys Glu Cys His Val Pro Val Pro Arg Ala Lys Leu Asn Tyr Ala Leu 
195 200 205 

CTG ATG TAT TTG GAA ATC ACA TCT GCC GGT GTG AGT TTT CAG TCA CCT 
732 

2 5 Leu Met Tyr Leu Glu He Thr Ser Ala Gly Val Ser Phe Gin Ser Pro 
210 215 220 

CTG ATG TCA CTG CAG CCC ATG CTT GTT GTG AAA CCC GAT CCA CCC TTA 
780 

Leu Met Ser Leu Gin Pro Met Leu Val Val Lys Pro Asp Pro Pro Leu 
30 225 230 235 240 

GGT TTG CAT ATG GAA GTC ACA GAT GAT GGT AAT TTA AAG ATT TCT TGG 
828 

Gly Leu His Met Glu Val Thr Asp Asp Gly Asn Leu Lys He Ser Trp 
245 250 255 
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GAC AGC CAA ACA ATG GCA CCA TTT CCG CTT CAA TAT CAG GTG AAA TAT 
876 

Asp Ser Gin Thr Met Ala Pro Phe Pro Leu Gin Tyr Gin Val Lys Tyr 
260 265 270 

5 TTA GAG AAT TCT ACA ATT GTA AGA GAG GCT GCT GAA ATT GTC TCA GCT 
924 

Leu Glu Aon Ser Thr lie Val Arg Glu Ala Ala Glu lie Val Ser Ala 
275 280 285 

ACA TCT CTG CTG GTA GAC AGT GTG CTT CCT GGA TCT TCA TAT GAG GTC 
10 972 

Thr Ser Leu Leu Val Asp Ser Val Leu Pro Gly Ser Ser Tyr Glu Val 
290 295 300 

CAG GTG AGG AGC AAG AGA CTG GAT GGT TCA GGA GTC TGG AGT GAC TGG 
1020 

15 Gin Val Arg Ser Lys Arg Leu Asp Gly Ser Gly Val Trp Ser Asp Trp 

305 310 315 320 

AGT TCA CCT CAA GTC TTT ACC ACA CAA GAT GTT GTG TAT TTT CCA CCC 
1068 

Ser Ser Pro Gin Val Phe Thr Thr Gin Asp Val Val Tyr Phe Pro Pro 

20 325 330 335 

AAA ATT CTG ACT AGT GTT GGA TCG AAT GCT TCT TTT CAT TGC ATC TAC 
1116 

Lys lie Leu Thr Ser Val Gly Ser Asn Ala Ser Phe His Cys lie Tyr 

340 345 350 

25 AAA AAC GAA AAC CAG ATT ATC TCC TCA AAA CAG ATA GTT TGG TGG AGG 
1164 

Lys Asn Glu Aen Gin lie lie Ser Ser Lys Gin lie Val Trp Trp Arg 
355 360 365 

AAT CTA GCT GAG AAA ATC CCT GAG ATA CAG TAC AGC ATT GTG AGT GAC 
30 1212 

Asn Leu Ala Glu Lys lie Pro Glu lie Gin Tyr Ser lie Val Ser Asp 
370 375 380 

CGA GTT AGC AAA GTT ACC TTC TCC AAC CTG AAA GCC ACC AGA CCT CGA 
1260 
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Arg Val Ser Lys Val Thr Phe Ser Asn Leu Lys Ala Thr Arg Pro Arc 

385 39 0 395 

395 400 

GGG AAG TTT ACC TAT GAC GCA GTG TAG TGC TGC AAT GAG GAG GCG TGC 
1308 

5 Gly Lys Phe Thr Tyr Asp Ala Val Tyr Cya Cys Asn Glu Gin Ala Cys 

405 «o 415 

CAT^CAC CGC TAT GCT GAA TTA TAG GTG ATC GAT GTC AAT ATC AAT ATA 

His His Arg Tyr Ala Glu Leu Tyr Val He Asp Val Asn He Asn He 
10 420 425 430 

TCA TGT GAA ACT GAC GGG TAC TTA ACT AAA ATG ACT TGC AGA TGG TCA 
1404 

Ser Cya Glu Thr Aap Gly Tyr Leu Thr Lys Met Thr Cys Arg Trp Ser 
435 440 445 

15 CCC AGO ACA ATC CAA TCA CTA GTG GGA AGC ACT GTG CAG CTG AGG TAT 
1452 

Pro Ser Thr He Gin Ser Leu Val Gly Ser Thr Val Gin Leu Arg Tvr 
450 455 46O 

CAC AGG CGC AGC CTG TAT TGT CCT GAT ACT CCA TCT ATT CAT CCT ACG 
2 0 1500 

His Arg Arg Ser Leu Tyr Cys Pro Asp Ser Pro Ser He His Pro Thr 

465 470 475 A*r, 

480 

TCT GAG CCC AAA AAC TGC GTC TTA CAG AGA GAC GGC TTT TAT GAA TGT 
1548 

2 5 ser Glu Pro Lys Asn Cys Val Leu Gin Arg Asp Gly Phe Tyr Glu Cys 



485 49 0 



495 



GTT TTC CAG CCA ATC TTT CTA TTA TCT GGC TAT ACA ATG TGG ATC AGG 
1596 

Val Phe Gin Pro He Phe Leu Leu Ser Gly Tyr Thr Met Trp He Arg 



30 500 505 



510 



ATC AAC CAT TCT TTA GGT TCA CTT GAC TCG CCA CCA ACG TGT GTC CTT 
1644 

He Asn His Ser Leu Gly Ser Leu Asp Ser Pro Pro Thr Cys Val Leu 
515 520 525 
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CCT GAC TCC GTA GTA AAA CCA CTA CCT CCA TCT AAC GTA AAA GCA GAG 
1692 

Pro Asp Ser Val Val Lys Pro Leu Pro Pro Ser Asn Val Lys Ala Glu 
530 535 540 

5 ATT ACT GTA AAC ACT GGA TTA TTG AAA GTA TCT TGG GAA AAG CCA GTC 
1740 

lie Thr Val Abii Thr Gly Leu Leu Lys Val Ser Trp Glu Lys Pro Val 

545 550 555 560 

TTT CCG GAG AAT AAC CTT CAA TTC CAG ATT CGA TAT GGC TTA AGT GGA 
10 1788 

Phe Pro Glu Asn Asn Leu Gin Phe Gin lie Arg Tyr Gly Leu Ser Gly 
565 570 575 

AAA GAA ATA CAA TGG AAG ACA CAT GAG GTA TTC GAT GCA AAG TCA AAG 
1836 

15 Lys Glu lie Gin Trp Lys Thr His Glu Val Phe Asp Ala Lys Ser Lys 
580 585 590 

TCT GCC AGC CTG CTG GTG TCA GAC CTC TGT GCA GTC TAT GTG GTC CAG 
1884 

Ser Ala Ser Leu Leu Val Ser Asp Leu Cys Ala Val Tyr Val Val Gin 
20 595 600 605 

GTT CGC TGC CGG CGG TTG GAT GGA CTA GGA TAT TGG AGT AAT TGG AGC 
1932 

Val Arg Cys Arg Arg Leu Asp Gly Leu Gly Tyr Trp Ser Asn Trp Ser 
610 615 620 

25 AGT CCA GCC TAT ACG CTT GTC ATG GAT GTA AAA GTT CCT ATG AGA GGG 
1980 

Ser Pro Ala Tyr Thr Leu Val Met Asp Val Lys Val Pro Met Arg Gly 

625 630 635 640 

CCT GAA TTT TGG AGA AAA ATG GAT GGG GAC GTT ACT AAA AAG GAG AGA 
30 2028 

Pro Glu Phe Trp Arg Lys Met Asp Gly Asp Val Thr Lys Lys Glu Arg 
645 650 655 

AAT GTC ACC TTG CTT TGG AAG CCC CTG ACG AAA AAT GAC TCA CTG TGT 
2076 
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Asn Val Thr Leu 
660 

AGT GTG AGG AGG 
2124 

5 Ser Val Arg Arg 
675 

TGG TCA GAA GAT 
2172 

Trp Ser Glu Asp 
10 690 

GAA CCA GCG CAC 
2220 

Glu Pro Ala His 
705 

15 TCC CTT GTG AAT 
2268 

Ser Leu Val Asn 



AGT GCT GTG GAG 
20 2316 

Ser Ala Val Glu 
740 

ATC CTT TCC TGG 
2364 

25 lie Leu Ser Trp 
755 

GTT ATT GAA TGG 
2412 

Val lie Glu Trp 
30 770 

AGA ATT CCC TCG 
2460 

Arg He Pro Ser 
785 
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Leu Trp Lys Pro Leu Thr 
665 

TAC GTT GTG AAG CAT CGT 

Tyr Val Val Lys His Arg 
680 

GTG GGA AAT CGG ACC AAT 

Val Gly Asn Arg Thr Asn 
695 

ACT GTT ACA GTT CTG GCT 

Thr Val Thr Val Leu Ala 
710 

TTT AAC CTT ACC TTC TCA 

Phe Asn Leu Thr Phe Ser 
725 730 

TCA CTC AGT GCT TAT CCC 

Ser Leu Ser Ala Tyr Pro 
745 

ACA CTG TCA CCT GAT GAT 

Thr Leu Ser Pro Asp Asp 
760 

AAG ATC CTT AAT GAA GAT 

Lys lie Leu Asn Glu Asp 
775 

AAT GTT AAA AAG TTT TAT 

Asn Val Lys Lys Phe Tyr 
790 



Lys Asn Asp Ser Leu Cys 
670 

ACT GCC CAC AAT GGG ACG 

Thr Ala His Asn Gly Thr 
685 

CTC ACT TTC CTG TGG ACA 

Leu Thr Phe Leu Trp Thr 
700 

GTC AAT TCC CTC GGC GCT 

Val Asn Ser Leu Gly Ala 
715 720 

TGG CCC ATG AGT AAA GTG 

Trp Pro Met Ser Lys Val 
735 

CTG AGC AGC AGO TGT GTC 

Leu Ser Ser Ser Cys Val 
750 

TAT AGT CTG TTA TAT CTG 

Tyr Ser Leu Leu Tyr Leu 
765 

GAT GGA ATG AAG TGG CTT 

Asp Gly Met Lys Trp Leu 
780 

ATC CAC GAT AAT TTT ATT 

lie His Asp Asn Phe He 
795 800 
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CCC ATC GAG AAA TAT CAG TTT AGT CTT TAC CCA GTA TTT ATG GAA GGA 
2508 

Pro lie Glu Lye Tyr Gin Phe Ser Leu Tyr Pro Val Phe Met Glu Gly 
805 810 815 

5 GTT GGA AAA CCA AAG ATA ATT AAT GGT TTC ACC AAA GAT GCT ATC GAC 
2556 

Val Gly Lys Pro Lys lie lie Asn Gly Phe Thr Lys Asp Ala lie Asp 
820 825 830 

AAG CAG CAG AAT GAC GGA GGG CTG TAT GTC ATT GTA CCC ATA ATT ATT 
10 2604 

Lys Gin Gin Asn Asp Ala Gly Leu Tyr Val lie Val Pro lie lie lie 
835 840 845 

TCC TCT TGT GTC CTA CTG CTC GGA ACA CTG TTA ATT TCA CAC CAG AGA 
2652 

15 Ser Ser Cys Val Leu Leu Leu Gly Thr Leu Leu lie Ser His Gin Arg 
850 855 860 

ATG AAA AAG TTG TTT TGG GAC GAT GTT CCA AAC CCC AAG AAT TGT TCC 
2700 

Met Lys Lys Leu Phe Trp Asp Asp Val Pro Asn Pro Lys Asn Cys Ser 

20 865 870 875 880 

TGG GCA CAA GGA CTG AAT TTC CAA AAG AGA ACG GAC ACT CTT 
2742 

Trp Ala Gin Gly Leu Asn Phe Gin Lys Arg Thr Asp Thr Leu 
885 890 

25 TGAAGTCTCT CATGACCACT ACAGATGAAC CCAATCTACC AACTTCCCAA CAGTCCATAC 
2 802 

AAT ATT AG AA G ATG TTT AC A TTTTGATGGA GGGAAACAAA CCTAAACTAT GGTTTGAATG 
2862 

ACTAAGAAAT AACATTTGAT GAG CTT ATT A GAGAAGTGTA TATTTTGTGG CCACAATGTA 
30 2922 

GGTTTGATGT AGTTCAGTTT G GG AC AT ATG CTTGATTTTC AGGGCATCAA AAATTTAAAG 
2982 
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TTGATATTCA TGGACTCTGC ATTTTATTTC 
3Q42 

TTGGTGTCAG AACCTATTTG GGTACAGATC 
3Q97 



169 " 

TT AAGT CAT A AAATGATAAT GGTGTGACGG 
ACCAAAATAT GGTAGGTAAT GCCTT 



5 (2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 894 amino acids 

(B) TYPE: amino acid 

( D ) TOPOLOGY : unknown 



10 (ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Met Cys Gin Lys Phe Tyr Val Val Leu Leu His Trp Glu Phe Leu 
15 10 15 

Tyr Val He Ala Ala Leu Asn Leu Ala Tyr Pro He Ser Pro Trp LyB 
15 20 25 30 

Phe Lys Leu Phe Cys Gly Pro Pro Asn Thr Thr Asp Asp Ser Phe Leu 
35 40 45 

Ser Pro Ala Gly Ala Pro Asn Asn Ala Ser Ala Leu Lys Gly Ala Ser 
50 55 60 

20 Glu Ala He Val Glu Ala Lys Phe Asn Ser Ser Gly He Tyr Val Pro 
65 70 75 80 

Glu Leu Ser Lys Thr Val Phe His Cys Cys Phe Gly Asn Glu Gin Gly 
85 90 95 

Gin Asn Cys Ser Ala Leu Thr Asp Asn Thr Glu Gly Lys Thr Leu Ala 
25 100 105 HO 



Ser Val Val Lys Ala Ser Val Phe Arg Gin Leu Gly Val Asn Trp Asp 
115 120 125 



WO 97/19952 



PCT/US96/19128 



- 170 - 

lie Glu CyB Trp Met Lys Gly Asp Leu Thr Leu Phe He cya His Met 
130 135 140 

Glu Pro Leu Pro Lys Asn Pro Phe Lys Asn Tyr Asp Ser Lys Val Hie 
145 ISO 155 160 

5 Leu Leu Tyr Asp Leu Pro Glu Val He Asp Asp Ser Pro Leu Pro Pro 
165 170 175 

Leu Lye Asp Ser Phe Gin Thr Val Gin CyB Asn Cys Ser Leu Arg Gly 
180 185 190 

Cys Glu Cys His Val Pro Val Pro Arg Ala Lys Leu Asn Tyr Ala Leu 
10 195 200 205 

Leu Met Tyr Leu Glu He Thr Ser Ala Gly Val Ser Phe Gin Ser Pro 
210 215 220 

Leu Met Ser Leu Gin Pro Met Leu Val Val Lys Pro Asp Pro Pro Leu 
225 230 235 240 

15 Gly Leu His Met Glu Val Thr Asp Asp Gly Asn Leu Lys He Ser Trp 

245 250 255 

Asp Ser Gin Thr Met Ala Pro Phe Pro Leu Gin Tyr Gin Val Lys Tyr 
260 265 270 

Leu Glu Asn Ser Thr He Val Arg Glu Ala Ala Glu He Val Ser Ala 
20 275 280 285 

Thr Ser Leu Leu Val Asp Ser Val Leu Pro Gly Ser Ser Tyr Glu Val 
290 295 300 

Gin Val Arg Ser Lys Arg Leu Asp Gly Ser Gly Val Trp Ser Asp Trp 
305 310 315 320 

25 Ser Ser Pro Gin Val Phe Thr Thr Gin Asp Val Val Tyr Phe Pro Pro 

325 330 335 



Lys He Leu Thr Ser Val Gly Ser Asn Ala Ser Phe His Cys He Tyr 
340 345 350 
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Lys Aan Glu Asn Gin lie lie Ser Ser Lys Gin He Val Trp Trp Arg 
355 360 365 

Asn Leu Ala Glu Lys He Pro Glu He Gin Tyr Ser He Val Ser Asp 
370 375 380 

5 Arg Val Ser Lys Val Thr Phe Ser Asn Leu Lys Ala Thr Arg Pro Arg 
385 390 395 400 

Gly Lys Phe Thr Tyr Asp Ala Val Tyr Cys Cys Asn Glu Gin Ala Cys 
405 410 415 

His His Arg Tyr Ala Glu Leu Tyr Val He Asp Val Asn He Asn He 
10 420 425 430 

Ser Cys Glu Thr Asp Gly Tyr Leu Thr Lys Met Thr Cys Arg Trp Ser 
435 440 445 

Pro Ser Thr lie Gin Ser Leu Val Gly Ser Thr Val Gin Leu Arg Tyr 
450 455 460 

15 His Arg Arg Ser Leu Tyr Cys Pro Asp Ser Pro Ser He His Pro Thr 
465 470 475 480 

Ser Glu Pro Lys Asn Cys Val Leu Gin Arg Asp Gly Phe Tyr Glu Cys 
485 490 495 

Val Phe Gin Pro lie Phe Leu Leu Ser Gly Tyr Thr Met Trp He Arg 
20 500 505 510 

He Asn His Ser Leu Gly Ser Leu Asp Ser Pro Pro Thr Cys Val Leu 
515 520 525 

Pro Asp Ser Val Val Lys Pro Leu Pro Pro Ser Asn Val Lys Ala Glu 
530 535 540 

2 5 He Thr Val Asn Thr Gly Leu Leu Lys Val Ser Trp Glu Lys Pro Val 
545 550 555 560 



Phe Pro Glu Asn Asn Leu Gin Phe Gin He Arg Tyr Gly Leu Ser Gly 
565 570 575 
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Lye Glu lie Gin Trp Lya Thr His Glu Val Phe Aop Ala Lys Ser Lys 
580 585 590 



Ser Ala Ser Leu Leu Val Ser Asp Leu Cys Ala Val Tyr Val Val Gin 
595 600 60S 

5 Val Arg Cya Arg Arg Leu Asp Gly Leu Gly Tyr Trp Ser Asn Trp Ser 
610 615 620 

Ser Pro Ala Tyr Thr Leu Val Met Asp Val Lys Val Pro Met Arg Gly 
625 630 635 640 

Pro Glu Phe Trp Arg Lys Met Asp Gly Asp Val Thr Lys Lys Glu Arg 
10 645 650 655 

Asn Val Thr Leu Leu Trp Lys Pro Leu Thr Lys Asn Asp Ser Leu Cys 
660 665 670 

Ser Val Arg Arg Tyr Val Val Lys His Arg Thr Ala His Asn Gly Thr 
675 680 685 

15 Trp Ser Glu Asp Val Gly Asn Arg Thr Asn Leu Thr Phe Leu Trp Thr 
690 695 70O 

Glu Pro Ala His Thr Val Thr Val Leu Ala Val Asn Ser Leu Gly Ala 
705 710 715 720 

Ser Leu Val Asn Phe Asn Leu Thr Phe Ser Trp Pro Met Ser Lys Val 
20 725 730 735 

Ser Ala Val Glu Ser Leu Ser Ala Tyr Pro Leu Ser Ser Ser Cys Val 
740 745 750 

lie Leu Ser Trp Thr Leu Ser Pro Asp Asp Tyr Ser Leu Leu Tyr Leu 
755 760 765 

2 5 Val He Glu Trp Lys He Leu Asn Glu Asp Asp Gly Met Lys Trp Leu 
770 775 780 



Arg lie Pro Ser Asn Val Lys Lys Phe Tyr lie His Asp Asn Phe He 
785 790 795 800 
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Pro He Glu Lys Tyr Gin Phe Ser Leu Tyr Pro Val Phe Met Glu Gly 
80S 810 815 

Val Gly Lys Pro Lys He He Asn Gly Phe Thr Lys Asp Ala He Asp 
820 825 830 

5 Lys Gin Gin Asn Asp Ala Gly Leu Tyr Val He Val Pro He He He 
835 840 845 

Ser Ser Cys Val Leu Leu Leu Gly Thr Leu Leu He Ser His Gin Arg 
850 855 860 



Met Lys Lys Leu Phe Trp Asp Asp Val Pro Asn Pro Lys Asn Cys Ser 

880 



10 865 870 875 



Trp Ala Gin Gly Leu Asn Phe Gin Lys Arg Thr Asp Thr Leu 

885 890 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 3871 baBe pairB 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE; cDNA 

20 ( ix ) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 19 4.. 3688 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

GGCACGAGCC GGTCTGGCTT GGGCAGGCTG CCCGGGCCGT GGCAGGAAGC CGGAAGCAGC 
2 5 60 



CGCGGCCCCA GTTCGGGAGA CATGGCGGGC GTTAAAGCTC TCGTGG C ATT ATCCTTCAGT 
120 
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GGGG CT ATTG GACTGACTTT TCTTATGCTG GGATGTGCCT TAGAGGATTA TGGGTGTACT 

iao 

TCTCTGAAGT AAG ATG ATT TGT CAA AAA TTC TGT GTG GTT TTG TTA CAT 
229 

5 Met lie Cys Gin Lya Phe Cys Val Val Leu Leu His 

15 10 

TGG GAA TTT ATT TAT GTG ATA ACT GCG TTT AAC TTG TCA TAT CCA ATT 
277 

Trp Glu Phe lie Tyr Val lie Thr Ala Phe Asn Leu Ser Tyr Pro lie 
10 15 20 25 

ACT CCT TGG AGA TTT AAG TTG TCT TGC ATG CCA CCA AAT TCA ACC TAT 
325 

Thr Pro Trp Arg Phe Lys Leu Ser Cys Met Pro Pro Asn Ser Thr Tyr 
30 35 40 

15 GAC TAG TTC CTT TTG CCT GCT GGA CTC TCA AAG AAT ACT TCA AAT TCG 
373 

Asp Tyr Phe Leu Leu Pro Ala Gly Leu Ser Lys Asn Thr Ser Asn Ser 
45 50 55 60 

AAT GGA CAT TAT GAG ACA GCT GTT GAA CCT AAG TTT AAT TCA AGT GGT 
20 421 

Asn Gly His Tyr Glu Thr Ala Val Glu Pro Lys Phe Asn Ser Ser Gly 

65 70 75 

ACT CAC TTT TCT AAC TTA TCC AAA ACA ACT TTC CAC TGT TGC TTT CGG 
469 

25 Thr Hia Phe Ser Asn Leu Ser Lys Thr Thr Phe HiB Cys Cys Phe Arg 
80 85 90 

AGT GAG CAA GAT AGA AAC TGC TCC TTA TGT GCA GAC AAC ATT GAA GGA 
517 

Ser Glu Gin Aap Arg Asn Cys Ser Leu Cys Ala Asp Asn lie Glu Gly 
3 0 95 100 105 

AAG ACA TTT GTT TCA ACA GTA AAT TCT TTA GTT TTT CAA CAA ATA GAT 
565 

Lya Thr Phe Val Ser Thr Val Asn Ser Leu Val Phe Gin Gin lie Asp 
110 115 120 
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GCA AAC TGG AAC ATA CAG TGC TGG CTA AAA GGA GAC TTA AAA TTA TTC 
613 

Ala Aen Trp Aan He Gin Cys Trp Leu Lys Gly Asp Leu Lys Leu Phe 



125 J.3Q 13s 



140 



5 ATC TGT TAT GTG GAG TCA TTA TTT AAG AAT CTA TTC AGG AAT TAT AAC 

661 

He Cys Tyr Val Glu Ser Leu Phe Lys Asn Leu Phe Arg Aan Tyr Aen 
145 150 155 

TAT AAG GTC CAT CTT TTA TAT GTT CTG CCT GAA GTG TTA GAA GAT TCA 
10 709 

Tyr Lys Val His Leu Leu Tyr Val Leu Pro Glu Val Leu Glu Asp Ser 
160 i6s 17Q 

CCT CTG GTT CCC CAA AAA GGC AGT TTT CAG ATG GTT CAC TGC AAT TGC 
757 

15 Pro Leu Val Pro Gin Lys Gly Ser Phe Gin Met Val His Cys Asn Cys 
175 180 las 

AGT GTT CAT GAA TGT TGT GAA TGT CTT GTG CCT GTG CCA ACA GCC AAA 
805 

Ser Val His Glu Cys Cys Glu Cys Leu Val Pro Val Pro Thr Ala Lys 
20 "0 195 200 

CTC AAC GAC ACT CTC CTT ATG TGT TTG AAA ATC ACA TCT GGT GGA GTA 
853 

Leu Asn Asp Thr Leu Leu Met Cys Leu Lys He Thr Ser Gly Gly Val 
205 210 215 220 

25 ATT TTC CAG TCA CCT CTA ATG TCA GTT CAG CCC ATA AAT ATG GTG AAG 
901 

He Phe Gin Ser Pro Leu Met Ser Val Gin Pro He Asn Met Val Lye 
225 2 3 0 23s 

CCT GAT CCA CCA TTA GGT TTG CAT ATG GAA ATC ACA GAT GAT GGT AAT 
30 949 

Pro Asp Pro Pro Leu Gly Leu His Met Glu He Thr Asp Asp Gly Aen 
240 245 250 

TTA AAG ATT TCT TGG TCC AGC CCA CCA TTG GTA CCA TTT CCA CTT CAA 
997 
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o 

260 



- .v. n= _ „ s „ ser _ _ ^ 

260 265 
T« C =TG „. T „ ^ „. MT m ^ ^ ^ ^ 

» * «. v., ly . ^ ser _ s „ T „ r _ ^ ^ o ^ ^ 

280 

« AAG .XT GTC TC, m m «, „ , IA „ „ 

ao £ - v " - - «. - .« »x ,.p s „ Ile _ Pct> 

u 295 

* 300 

— «T „ GAG GTT CAO GTG AGG GG= AAG AG, CT= GAT GGC CCA 

«, *~ Tyr ol v>1 oln v<a , r5 oly ^ ^ 

310 J15 
« ««ATC TGG ACT GAC TGG ACT ACT CCT CGT GTC TTT ACC ACA CAA c. T 

oly rrp 1Z " p Trp s " Thr Pr ° A " v " ■">* ™* «» ».p 

325 330 

20 "a/" " C CCA CCT *** MT CTG » c * ggg tct aat gtt 

340 345 
CAC TCC ATC TAT AAG AAG GAA M C « ATT GTT CCC TCA 

25 Ser „. His c ye Ile Tyr Lys Glu ftsn ^ ^ 

355 360 
C-ATT GTT TCC TGG ATG AAT TTA CCT GAG AAA ATT CCT CAA ACC CAG 
~« XXe y.x Trp Trp «e t Asn Leu Ala «» ty . Ile Pro oln Ser oln 

J75 380 
^ GAT GTT GTG ACT GAT CAT GTT AGO AAA GTT ACT TTT TTC AAT CTC 

Tyr Ao P v.x Val s. r Agp „ i8 v.i s er Ly3 vax Thr Phe Phe ABn Leu 

Jo5 ion 



30 365 



390 395 
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« « ~ m „ «. « „ „ GM _ mc w 

410 

5 TGC AAT GAA CAT GAA TGC CAT CAT CGC i«*ip „ 

1477 AT 000 GAA TTA TAT GTG ATT 

~ - «. «. ». c. ... ^ 9 ^ A1 . M> ^ ^ ^ ^ 

i. ^ 5 OIC "* « ™ « » - « « « w J „ ACI • 
». P ~ .„ „. ^ „. ii)t , <p ^ ^ ^ ^ ^ 

440 

-ACT TGC AGA TGC TCA ACC AGT ACA ATC CAO TCA CTT GCG GAA AGC 

15 Met Thr Cys Arg Trp Ser Thr Ser Thr n. el c 

445 Thr Ile «n Ser Leu Ala Glu Ser 

455 

460 

ACT TTG CAA TTG AGG TAT CAT ACT »™ 

1621 " AGC CTT TAC TGT TCT GAT ATT 

Thr Leu Gin Leu ft ra tv«- m„ , 

4el ^ ^ "~ «*- Asp lae 

470 

475 

CCA TCT ATT CAT CCC ATA TCT r»r n 

TCT CCC GAT «C TAT TTG CAG AGT 

Pro Ser He His Pro Ile Ser 

485 

490 

25 GAT GGT TTT TAT r*** » — 

i?i7 

Asp Gly Phe Tyr Glu Cys Ile Phe Gin Pro lie Phe Le T 

495 1Xe Phe Leu Ser Gly 

505 

TAC ACA ATG TGG ATT Arr * w * 
30 1,65 ^ CAC TC * CTA GGT TCA CTT GAC CTC 

Tyr Thr Met Trp He Aro ti« * 
510 

A5 520 
CCA CCA ACA TGT GTC CTT CCT r*r ^ 

1813 CT GTG GTG AAG CCA CTG TCT CCA 
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Pro P ro Th r Cye Val Leu Pro Aap Ser Val Val 

525 ^-ys Pro Leu Ser Pro 

535 

540 

***** GTG AAA CCA GAA ATT ACT ATA AAC ATT CCA TTA TTG AAA ATA 
» - Ser Val Ly8 clu _ ft8n ^ ^ u 

550 S5S 
TCT TGG GAA AAG CCA GTC TTT CCA gap s „ 

1909 CA AAT CTT CAA TTC CAG ATT 

Ser Trp Glu Lys Pro Val ok. „ 

560 J. 1 " AB " AB " LeU «" «» Gin Ile 



10 

565 



570 



CGC TAT GGT TTA ACT GGA AAA GAA GTA CAA TGG AAP ^ 

1957 TGG ATG TAT GAG GTT 

^ Gly L6U Ser «Y ty- Glu Val Gin Trp Lys M et , 

575 CQ „ P Lys Met T Vr Glu Val 



580 585 



" ™ ^ ^ ~ GTC ACT CTC CCA GTT CCA GAC TTG TGT 

rrr Asp ai, Lya ser Lys val ser ^ ^ ft8p ^ 

MS> 600 

2 0 ™ 0CT °" -««:«. «<= « gt. gat «. ^ «. 

£ v., ^ A1 . val vil ^ cy> ^ L ^ ^ 

620 

™ AGT „ T w AGC AAT M oco T , c „ ^ am 

« ^ *~ ..„ ^ s « .„„ Pro „. ^ T „ r ^ ^ ^ ^ 

635 

^ 9 C " °" *~ ~ « GAA TTT TGG ASA ATA ATT AAT GQA «T 

^ V, „ a., „„ , ro ». Tcp rl . ^ ^ 

64 5 

650 

ACT 7 ATG AAA AAG GAG AAA AAT GTC ACT TTA CTT TGG AAG CCC CTC ATG 
Thr „e t Ly3 Lye oXu Lya A8 „ val ^ ^ ^ ^ 8 ^ ^ 



660 665 
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AAA AAT GAC TCA TTG TGC AGT GTT CAG AGA TAT GTG ATA AAC CAT CAT 
2245 

Lys Asn Asp Ser Leu Cys Ser Val Gin Arg Tyr Val lie Aon His His 
670 675 680 

5 ACT TCC TGC AAT GGA ACA TGG TCA GAA GAT GTG GGA AAT CAC ACG AAA 
2293 

Thr Ser Cys Asn Gly Thr Trp Ser Glu Asp Val Gly Asn His Thr Lys 

685 690 695 700 

TTC ACT TTC CTG TGG ACA GAG CAA GCA CAT ACT GTT ACG GTT CTG GCC 
10 2341 

Phe Thr Phe Leu Trp Thr Glu Gin Ala His Thr Val Thr Val Leu Ala 
705 710 715 

ATC AAT TCA ATT GGT GCT TCT GTT GCA AAT TTT AAT TTA ACC TTT TCA 
2389 

15 lie Asn Ser lie Gly Ala Ser Val Ala Asn Phe Asn Leu Thr Phe Ser 
720 725 730 

TGG CCT ATG AGC AAA GTA AAT ATC GTG CAG TCA CTC AGT GCT TAT CCT 
2437 

Trp Pro Met Ser Lys Val Asn lie Val Gin Ser Leu Ser Ala Tyr Pro 
20 735 740 745 

TTA AAC AGC AGT TGT GTG ATT GTT TCC TGG ATA CTA TCA CCC AGT GAT 
2485 

Leu Asn Ser Ser Cys Val lie Val Ser Trp lie Leu Ser Pro Ser Asp 
750 755 760 

2 5 TAC AAG CTA ATG TAT TTT ATT ATT GAG TGG AAA AAT CTT AAT GAA GAT 
2533 

Tyr Lys Leu Met Tyr Phe lie lie Glu Trp Lys Asn Leu Asn Glu Asp 
765 770 775 780 

GGT GAA ATA AAA TGG CTT AGA ATC TCT TCA TCT GTT AAG AAG TAT TAT 
30 2581 

Gly Glu lie Lys Trp Leu Arg lie Ser Ser Ser Val Lys Lys Tyr Tyr 
785 790 795 

ATC CAT GAT CAT TTT ATC CCC ATT GAG AAG TAC CAG TTC AGT CTT TAC 
2629 
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lie Hie Asp His Phe 
800 

CCA ATA TTT ATG GAA 
2677 

5 Pro lie Phe Met Glu 
815 

ACT CAA GAT GAT ATT 
2725 

Thr Gin Asp Asp lie 
10 830 

ATT GTG CCA GTA ATT 
2773 

He Val Pro Val He 
845 

15 TTA ATA TCA CAC CAA 
2821 

Leu lie Ser His Gin 
865 

AAC CCC AAG AAT TGT 
20 2869 

Asn Pro Lys Asn Cys 
880 

GAA ACG TTT GAG CAT 
2917 

2 5 Glu Thr Phe Glu His 
895 

GGT CCT CTT CTT TTG 
2965 

Gly Pro Leu Leu Leu 
30 910 

GAT ACA TCA TGG AAA 
3013 

Asp Thr Ser Trp Lys 
925 



- 180 - 

lie Pro lie Glu Lys Tyr 
805 

GGA GTG GGA AAA CCA AAG 

Gly Val Gly Lys Pro Lys 
820 

GAA AAA CAC CAG AGT GAT 

Glu Lys His Gin Ser Asp 
835 

ATT TCC TCT TCC ATC TTA 

lie Ser Ser Ser lie Leu 
850 855 

AGA ATG AAA AAG CTA TTT 

Arg Met Lys Lys Leu Phe 
870 

TCC TGG GCA CAA GGA CTT 

Ser Trp Ala Gin Gly Leu 
885 

CTT TTT ATC AAG CAT ACA 

Leu Phe lie Lys His Thr 
900 

GAG CCT GAA ACA ATT TCA 

Glu Pro Glu Thr He Ser 
915 

AAT AAA GAT GAG ATG ATG 

Asn Lys Asp Glu Met Met 
930 935 



PCT/LJS96/19128 

Gin Phe Ser Leu Tyr 
810 

ATA ATT AAT AGT TTC 

He He Asn Ser Phe 
825 

GCA GGT TTA TAT GTA 

Ala Gly Leu Tyr Val 
840 

TTG CTT GGA ACA TTA 

Leu Leu Gly Thr Leu 
860 

TGG GAA GAT GTT CCG 

Trp Glu Asp Val Pro 
875 

AAT TTT CAG AAG CCA 

Asn Phe Gin Lys Pro 
890 

GCA TCA GTG ACA TGT 

Ala Ser Val Thr Cys 
905 

GAA GAT ATC AGT GTT 

Glu Asp He Ser Val 
920 

CCA ACA ACT GTG GTC 

Pro Thr Thr Val Val 
940 
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TCT CTA CTT TCA ACA ACA GAT 
3061 

Ser Leu Leu Ser Thr Thr Asp 
945 

5 GAC CAG TTC AAC AGT GTT AAC 
3109 

Asp Gin Phe Asn Ser Val Asn 
960 

ACC TAT GAG GAC GAA AGC CAG 
10 3157 

Thr Tyr Glu Asp Glu Ser Gin 
975 

CTG ATC AGC AAC TCT AAA CCA 
3205 

15 Leu lie Ser Asn Ser Lys Pro 
990 995 

ATA AAT AGT TCA GTC ACC AAG 
3253 

lie Asn Ser Ser Val Thr Lys 

20 1005 1010 

AAG GAT TCT TTC TCT AAT AGC 
3301 

Lys Asp Ser Phe Ser Asn Ser 
1025 

2 5 TTT ATA TTA TCA GAT CAG CAT 
3349 

Phe lie Leu Ser Asp Gin His 
1040 

TTC TCA GAA GGA TTG GAT GAA 
30 3397 

Phe Ser Glu Gly Leu Asp Glu 
1055 

GAA GAA AAT AAT GAT AAA AAG 
3445 



- 181 - 

CTT GAA AAG GGT TCT GTT TGT ATT AGT 

Leu Glu Lys Gly Ser Val Cys lie Ser 
950 955 

TTC TCT GAG GCT GAG GGT ACT GAG GTA 

Phe Ser Glu Ala Glu Gly Thr Glu Val 
965 970 

AGA CAA CCC TTT GTT AAA TAC GCC ACG 

Arg Gin Pro Phe Val Lys Tyr Ala Thr 
980 985 

AGT GAA ACT GGT GAA GAA CAA GGG CTT 

Ser Glu Thr Gly Glu Glu Gin Gly Leu 
1000 

TGC TTC TCT AGC AAA AAT TCT CCG TTG 

Cys Phe Ser Ser Lys Asn Ser Pro Leu 
1015 1O20 

TCA TGG GAG ATA GAG GCC CAG GCA TTT 

Ser Trp Glu lie Glu Ala Gin Ala Phe 
1030 1035 

CCC AAC ATA ATT TCA CCA CAC CTC ACA 

Pro Asn He He Ser Pro His Leu Thr 
1045 1050 

CTT TTG AAA TTG GAG GGA AAT TTC CCT 

Leu Leu Lys Leu Glu Gly Asn Phe Pro 
1060 1065 

TCT ATC TAT TAT TTA GGG GTC ACC TCA 
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Glu Glu Asn Asn Asp Lys Lys Ser lie Tyr Tyr Leu Gly Val Thr Ser 
107 ° 1075 loao 

ATC AAA AAG AGA GAG AGT GGT GTG CTT TTG ACT GAC AAG TCA AGG GTA 
3493 

5 lie Lys Lys Arg Glu Ser Gly Val Leu Leu Thr Asp Lys Ser Arg Val 
1085 1090 1095 1100 

TCG TGC CCA TTC CCA GCC CCC TGT TTA TTC ACG GAC ATC AGA GTT CTC 
3541 

Ser Cys Pro Phe Pro Ala Pro Cys Leu Phe Thr Asp He Arg Val Leu 
10 1105 mo ins 

CAG GAC AGT TGC TCA CAC TTT GTA GAA AAT AAT ATC AAC TTA GGA ACT 
3589 

Gin Asp Ser Cys Ser His Phe Val Glu Asn Asn He Asn Leu Gly Thr 
1120 H25 H30 

15 TCT AGT AAG AAG ACT TTT GCA TCT TAC ATG CCT CAA TTC CAA ACT TGT 
3637 

Ser Ser Lys Lys Thr Phe Ala Ser Tyr Met Pro Gin Phe Gin Thr Cys 
1135 H40 H45 

TCT ACT CAG ACT CAT AAG ATC ATG GAA AAC AAG ATG TGT GAC CTA ACT 
20 3685 

Ser Thr Gin Thr His Lys lie Met Glu Asn Lys Met Cys Asp Leu Thr 
1150 H55 H60 

GTG TAATTTCACT GAAGAAACCT TCAGATTTGT GTTATAATGG G TAAT AT AAA 
3738 
2 5 Val 
1165 

GTG TAAT AG A TTATAGTTGT GGGTGGGAGA GAGAAAAGAA ACCAGAGTCC AAATTTGAAA 
3798 

ATAATTGTTC CCAACTGAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 
30 3858 

AAAAAAAAAA AAA 
3871 
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(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1165 amino acide 

(B) TYPE: amino acid 
5 ( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met lie Cys Gin Lys Phe Cys Val Val Leu Leu His Trp Glu Phe lie 
15 10 15 

10 Tyr Val lie Thr Ala Phe Asn Leu Ser Tyr Pro He Thr Pro Trp Arg 
20 25 30 

Phe Lys Leu Ser Cys Met Pro Pro Asn Ser Thr Tyr Asp Tyr Phe Leu 
35 40 45 

Leu Pro Ala Gly Leu Ser Lys Asn Thr Ser Asn Ser Asn Gly His Tyr 
15 50 55 60 

Glu Thr Ala Val Glu Pro Lys Phe Asn Ser Ser Gly Thr His Phe Ser 
65 70 75 80 

Asn Leu Ser Lys Thr Thr Phe His Cys Cys Phe Arg Ser Glu Gin Asp 
85 90 95 

20 Arg Asn Cys Ser Leu Cys Ala Asp Asn He Glu Gly Lys Thr Phe Val 
100 105 HO 

Ser Thr Val Asn Ser Leu Val Phe Gin Gin He Asp Ala Asn Trp Asn 
115 120 125 

lie Gin Cys Trp Leu Lys Gly Asp Leu Lys Leu Phe lie Cys Tyr Val 
25 130 135 140 



Glu Ser Leu Phe Lys Asn Leu Phe Arg Asn Tyr Asn Tyr Lys Val His 
145 150 155 160 
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Leu Leu Tyr Val Leu 
165 

Gin Lys Gly Ser Phe 
180 

5 Cys Cys Glu Cys Leu 
195 

Leu Leu Met: Cys Leu 
210 

Pro Leu Met Ser Val 
10 225 

Leu Gly Leu His Met 
245 

Trp Ser Ser Pro Pro 
260 

15 Tyr Ser Glu Asn Ser 
275 

Ser Ala Thr Ser Leu 
290 

Glu Val Gin Val Arg 
20 305 

Asp Trp Ser Thr Pro 
325 

Pro Pro Lya lie Leu 
340 

25 He Tyr Lys Lys Glu 
355 

Trp Met Asn Leu Ala 
370 
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Pro Glu Val Leu Glu 
170 

Gin Met Val His Cys 
185 

Val Pro Val Pro Thr 
200 

Lya He Thr Ser Gly 
215 

Gin Pro lie Asn Met 
230 

Glu lie Thr Asp Asp 
250 

Leu Val Pro Phe Pro 
265 

Thr Thr Val He Arg 
280 

Leu Val Asp Ser lie 
295 

Gly Lys Arg Leu Asp 
310 

Arg Val Phe Thr Thr 
330 

Thr Ser Val Gly Ser 
345 

Asn Lys He Val Pro 
360 

Glu Lys He Pro Gin 
375 



Asp Ser Pro Leu Val Pro 
175 

Asn Cys Ser Val His Glu 
190 

Ala Lys Leu Asn Asp Thr 

205 

Gly Val He Phe Gin Ser 
220 

Val Lys Pro Asp Pro Pro 
235 240 

Gly Asn Leu Lys He Ser 
255 

Leu Gin Tyr Gin Val Lys 
270 

Glu Ala Aap Lys He Val 
285 

Leu Pro Gly Ser ser Tyr 
300 

Gly Pro Gly He Trp Ser 
315 320 

Gin Asp Val He Tyr Phe 
335 

Asn Val Ser Phe His Cys 
350 

Ser Lys Glu He Val Trp 
365 

Ser Gin Tyr Asp Val Val 
380 
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Ser Asp His Val Ser Lys Val Thr Phe Phe Asn Leu Asn Glu Thr Lys 
385 390 395 400 

Pro Arg Gly Lys Phe Thr Tyr Asp Ala Val Tyr Cys Cys Asn Glu His 
405 410 415 

5 Glu Cys His His Arg Tyr Ala Glu Leu Tyr Val lie Asp Val Asn lie 
420 425 430 

Asn lie Ser Cys Glu Thr Asp Gly Tyr Leu Thr Lys Met Thr Cys Arg 
435 440 445 

Trp Ser Thr Ser Thr lie Gin Ser Leu Ala Glu Ser Thr Leu Gin Leu 
10 450 455 460 

Arg Tyr His Arg Ser Ser Leu Tyr Cys Ser Asp lie Pro Ser lie His 
465 470 475 480 

Pro lie Ser Glu Pro Lys Asp Cys Tyr Leu Gin Ser Asp Gly Phe Tyr 
485 490 495 

15 Glu Cys lie Phe Gin Pro lie Phe Leu Leu Ser Gly Tyr Thr Met Trp 
500 505 510 

lie Arg lie Asn His Ser Leu Gly Ser Leu Asp Leu Pro Pro Thr CyB 
515 520 525 

Val Leu Pro Asp Ser Val Val Lys Pro Leu Ser Pro Ser Ser Val Lys 
20 530 535 540 

Ala Glu lie Thr lie Asn lie Gly Leu Leu Lys lie Ser Trp Glu Lys 
545 550 555 560 

Pro Val Phe Pro Glu Asn Asn Leu Gin Phe Gin lie Arg Tyr Gly Leu 
565 570 575 

25 Ser Gly Lys Glu Val Gin Trp Lys Met Tyr Glu Val Tyr Asp Ala Lye 
580 585 590 



Ser Lys Ser Val Ser Leu Pro Val Pro Asp Leu Cys Ala Val Tyr Ala 
595 600 605 
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Val Gin val Arg Cys Lys Arg Leu Asp Gly Leu Gly Tyr Trp Ser Asn 
610 615 620 



Trp Ser Asn Pro Ala Tyr Thr Val Val Met Asp He Lys Val Pro Met 
«5 630 635 640 

5 Arg Gly Pro Glu Phe Trp Arg He He Asn Gly Asp Thr Met Lys Lye 
645 650 655 

Glu Lys Asn Val Thr Leu Leu Trp Lys Pro Leu Met Lys Asn Asp Ser 
660 665 670 

Leu Cys Ser Val Gin Arg Tyr Val He Asn His His Thr Ser Cys Asn 
10 675 680 685 

Gly Thr Trp Ser Glu Asp Val Gly Asn HiB Thr Lys Phe Thr Phe Leu 
690 695 700 

Trp Thr Glu Gin Ala His Thr Val Thr Val Leu Ala He Asn Ser He 
70S 710 715 720 

15 Gly Ala Ser Val Ala Asn Phe Asn Leu Thr Phe Ser Trp Pro Met Ser 

725 730 735 

Lys Val Asn He Val Gin Ser Leu Ser Ala Tyr Pro Leu Asn Ser Ser 
740 745 750 

Cys Val He Val Ser Trp He Leu Ser Pro Ser Asp Tyr Lys Leu Met 
20 755 760 765 

Tyr Phe He He Glu Trp Lys Asn Leu Asn Glu Asp Gly Glu He Lys 
770 775 780 

Trp Leu Arg He Ser Ser Ser Val Lys Lys Tyr Tyr He His Asp His 
785 790 795 800 

25 Phe He Pro He Glu Lys Tyr Gin Phe Ser Leu Tyr Pro He Phe Met 

805 810 815 



Glu Gly Val Gly Lya Pro Lys He He Aan Ser Phe Thr Gin Asp Asp 
820 825 830 
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lie Glu Lys His Gin Ser Asp Ala Gly Leu Tyr Val lie Val Pro Val 
835 840 845 

lie lie Ser Ser Ser lie Leu Leu Leu Gly Thr Leu Leu lie Ser His 
850 855 860 

5 Gin Arg Met Lyo Lya Leu Phe Trp Glu Asp Val Pro Asn Pro Lys Asn 
865 870 875 880 

Cys Ser Trp Ala Gin Gly Leu Asn Phe Gin Lys Pro Glu Thr Phe Glu 
885 890 895 

His Leu Phe lie Lys His Thr Ala Ser Val Thr Cys Gly Pro Leu Leu 
10 900 905 910 

Leu Glu Pro Glu Thr lie Ser Glu Asp lie Ser Val Asp Thr Ser Trp 
915 920 925 

Lys Asn Lys Asp Glu Met Met Pro Thr Thr Val Val Ser Leu Leu Ser 
930 935 940 

15 Thr Thr Asp Leu Glu Lys Gly ser Val Cys lie Ser Asp Gin Phe Asn 
945 950 955 960 



Ser Val Asn Phe Ser Glu Ala Glu Gly Thr Glu Val Thr Tyr Glu Asp 
965 970 975 

Glu Ser Gin Arg Gin Pro Phe Val Lys Tyr Ala Thr Leu lie Ser Asn 
20 980 985 990 

Ser Lys Pro Ser Glu Thr Gly Glu Glu Gin Gly Leu lie Asn Ser Ser 
995 1000 1005 

Val Thr Lys Cys Phe Ser Ser Lys Asn Ser Pro Leu Lys Asp Ser Phe 
1010 1015 1020 

2 5 Ser Asn Ser Ser Trp Glu He Glu Ala Gin Ala Phe Phe He Leu Ser 
1025 1030 1035 i 0 40 



Asp Gin His Pro Asn lie lie Ser Pro His Leu Thr Phe Ser Glu Gly 
1045 1050 1055 
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Leu Asp Glu Leu Leu Lys Leu Glu Gly Aan Phe Pro Glu Glu Asn Asn 
1060 1065 1070 

Asp Lys Lys Ser lie Tyr Tyr Leu Gly Val Thr Ser He Lys Lys Arg 
1075 1080 1085 

5 Glu Ser Gly Val Leu Leu Thr Aap Lys Ser Arg Val Ser Cya Pro Phe 
1090 1095 1100 

Pro Ala Pro Cya Leu Phe Thr Asp He Arg Val Leu Gin Asp Ser Cys 
1105 1110 1115 H20 

Ser His Phe Val Glu Asn Asn lie Asn Leu Gly Thr Ser Ser Lys Lys 
10 1125 1130 H35 

Thr Phe Ala Ser Tyr Met Pro Gin Phe Gin Thr Cys Ser Thr Gin Thr 
1140 1145 1150 

HiB Lys lie Met Glu Asn Lys Met Cys Asp Leu Thr Val 
1155 1160 1165 



15 (2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 488 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

2 0 (ii> MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Tyr lie Ser Pro Glu Ser Pro Val Val Gin Leu His Ser Asn Phe Thr 
15 10 15 

Ala Val Cys Val Leu Lys Glu Lys Cys Met Asp Tyr Phe His Val Asn 
25 20 25 30 



Ala Asn Tyr lie Val Trp Lys Thr Asn His Phe Thr lie Pro Lys Glu 
35 40 45 
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Gin Tyr Thr He He Asn Arg Thr Ala Ser Ser Val Thr Phe Thr Asp 
50 55 60 



lie Ala Ser Leu Asn lie Gin Leu Thr Cys Asn lie Leu Thr Phe Gly 
65 70 75 80 

5 Gin Leu Glu Gin Asn Val Tyr Gly He Thr lie lie Ser Gly Leu Pro 

85 90 95 

Pro Glu Lys Pro Lya Asn Leu Ser Cys He Val Asn Glu Gly Lys Lys 
100 105 110 

Met Arg Cys Glu Trp Asp Gly Gly Arg Glu Thr His Leu Glu Thr Asn 
10 115 120 125 

Phe Thr Leu Lys Ser Glu Trp Ala Thr His Lys Phe Ala Asp Cys Lys 
130 135 140 

Ala Lys Arg Asp Thr Pro Thr Ser Cys Thr Val Asp Tyr Ser Thr Val 
145 150 155 160 

15 Tyr Phe Val Asn He Glu Val Trp Val Glu Ala Glu Asn Ala Leu Gly 

165 170 175 

Lys Val Thr Ser Asp His He Asn Phe Asp Pro Val Tyr Lys Val Lys 
180 185 190 

Pro Asn Pro Pro His Asn Leu Ser Val He Asn Ser Glu Glu Leu Ser 
20 195 200 205 

Ser He Leu Lys Leu Thr Trp Thr Asn Pro Ser He Lys Ser Val He 
210 215 220 

He Leu Lys Tyr Asn He Gin Tyr Arg Thr Lys Asp Ala Ser Thr Trp 
225 230 235 240 

25 Ser Gin He Pro Pro Glu ABp Thr Ala Ser Thr Arg Ser Ser Phe Thr 

245 250 255 



Val Gin Asp Leu Lys Pro Phe Thr Glu Tyr Val Phe Arg He Arg Cys 
260 265 270 
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Met Lys Glu Asp Gly Lys Gly Tyr Trp Ser Aap Trp Ser Glu Glu Ala 
275 280 285 

Ser Gly lie Thr Tyr Glu Asp Arg Pro Ser Lys Ala Pro Ser Phe Trp 
290 295 300 

5 Tyr Lya He Asp Pro Ser His Thr Gin Gly Tyr Arg Thr Val Gin Leu 

305 310 315 320 

Val Trp Lye Thr Leu Pro Pro Phe Glu Ala Asn Gly Lys He Leu Asp 
325 330 335 

Tyr Glu Val Thr Leu Thr Arg Trp Lys Ser His Leu Gin Asn Tyr Thr 
10 340 345 3so 

Val Asn Ala Thr Lys Leu Thr Val Asn Leu Thr Asn Asp Arg Tyr Leu 
355 360 365 

Ala Thr Leu Thr Val Arg Asn Leu Val Gly Lya Ser Asp Ala Ala Val 
370 375 380 

15 Leu Thr He Pro Ala Cys Asp Phe Gin Ala Thr His Pro Val Met Asp 

385 390 395 40 0 



Leu Lye Ala Phe Pro Lys Asp Asn 
405 

Pro Arg Glu Ser Val Lys Lys Tyr 
20 420 

Asp Lys Ala Pro Cys lie Thr Asp 
435 440 

His Arg Thr Tyr Leu Arg Gly Asn 
450 455 

25 He Thr Val Thr Pro Val Tyr Ala 

465 470 



Met Leu Trp Val Glu Trp Thr Thr 
410 415 

He Leu Glu Trp Cys Val Leu Ser 
425 430 

Trp Gin Gin Glu Asp Gly Thr Val 
445 

Leu Ala Glu Ser Lys Cys Tyr Leu 
460 

Asp Gly Pro Gly Ser Pro Glu Ser 
475 480 



He Lys Ala Tyr Leu Lys Gin Ala 
485 
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(2) INFORMATION FOR SEQ ID NO16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Trp Ser Xaa Trp Ser 
1 5 



10 (2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 17 base pairs 
(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
15 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

CATCTTACTT CAGAGAA 
17 



2 0 (2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
25 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:8: 



CAT CTT ACTT CAGAGAAGTA CAC 
23 



(2) INFORMATION FOR SEQ ID NO: 9: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

10 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:9: 

CATCTTACTT CAGAGAAGTA CACCCATAA 
29 



(2) INFORMATION FOR SEQ ID NO: 10: 

15 (i) SEQUENCE CHARACTERISTICS i 

(A) LENGTH: 35 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

2 0 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

CATCTTACTT CAGAGAAGTA CACCCATAAT CCTCT 
35 



(2) INFORMATION FOR SEQ ID NO:ll: 



25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:ll: 

AATCATCTTA CTTCAGAGAA GTACACCCAT AATCC 
35 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 
10 {A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : a ingle 

(D) TOPOLOGY: linear 

(ii> MOLECULE TYPE: DNA 
15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

CTTACTTCAG AGAAGTACAC CCATAATCC 
29 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D ) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 



TCAGAGAAGT ACACCCATAA TCC 
23 
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(2) INFORMATION FOR SEQ ID NO: 14: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

5 (C) ST RAND ED NESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO:14: 

AAGTACACCC ATAATCC 
10 17 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 base pairs 

(B) TYPE: nucleic acid 
15 <C) STRANDEDNESS: single 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: RNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

ACAGAAUUUU UGACAAAUCA AAGCAGANNN NUCUGAGNAG UCCUUACUUC AGAGAA 
20 56 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 base pairs 

(B) TYPE: nucleic acid 

2 5 (C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: RNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

GGCCCGGGCA GCCUGCCCAA AGCCGGNNNN CCGGAGNAGU CGCCAGACCG GCUCGUG 
57 



(2) INFORMATION FOR SEQ ID NO: 17: 

5 <i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

10 (ii) MOLECULE TYPE: RNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:17: 

UGGCAUGCAA GACAAAGCAG GNNNNCCUGA GNAGUCCUUA AAUCUCCAAG GAGUAA 
56 



(2) INFORMATION FOR SEQ ID NO: 18: 

15 <i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: unknown 

2 0 (ii) MOLECULE TYPE: RNA 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 18: 

UAUAUGACAA AGCUGUNNNN AC AGAGN AG U CCUUGUGUGG UAAAGACACG 
50 



(2) INFORMATION FOR SEQ ID NO: 19: 

2 5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 61 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: unknown 

<ii) MOLECULE TYPE: RNA 

5 (xi) SEQUENCE DESCRIPTION s SEQ ID NO: 19: 

AGCACCAAUU GAAUUGAUGG CCAAAGCGGG NNNNCCCGAG NAGUCAACCG U AACAG U AUG 
60 

U 
61 



10 (2) INFORMATION FOR SEQ ID NO: 20: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 69 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 

15 (D) TOPOLOGY: unknown 

<ii) MOLECULE TYPE: RNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 



UGAAAUUGUU UCAGGCUCCA AAGCCGGNNN NCCGGAGNAG UCAAGAAGAG GACCACAUGU 
60 

2 0 CACUGAUGC 
69 



(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 61 base pairs 
25 (B) TYPE: nucleic acid 

< C ) STRANDEDNESS : s ingle 
( D ) TOPOLOGY : unknown 
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(ii) MOLECULE TYPE: RNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

GGUUUCUUCA GUGAAAUUAC ACAAAGCAGC NNNNGCUGAG NAGUCAGUUA GGUCACACAU 
60 

5 C 
61 



(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 53 base pairs 
10 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: RNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

15 ACCCAUUAUA ACACAAAGCU GANNNNUCAG AGNAGUCAUC UGAAGGUUUC UUC 
53 



(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 17 base pairs 
2 0 (B) TYPE: nucleic acid 

<C> STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 



2 5 GCTGCACTTA ACCTGGC 
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(2) INFORMATION FOR SEQ ID NO: 24: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

5 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 

GGATAACTCA GGAACG 
10 16 



(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: Bingle 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:25 



C ACT ATTTG C CCTTCAG 
20 17 



(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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{xi) SEQUENCE DESCRIPTION: SEQ ID NO:26: 

G CCTG AG AT A GGGGTGC 
17 



(2) INFORMATION FOR SEQ ID NO: 27: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : linear 

10 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

C ACT ATTTG C CCTTCAG 
17 



(2) INFORMATION FOR SEQ ID NO: 28: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 ine ar 

20 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

GCCTGAGATA GGGGTGC 
17 



(2) INFORMATION FOR SEQ ID NO: 29: 



25 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 8 amino acids 
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(B) TYPE: amino acid 
{ D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

5 Pro Asn Pro Lys Asn Cya Ser Trp 

1 5 



(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 base pairs 
10 (B) TYPE: nucleic acid 

(C) STRANDED NESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

15 CCAAACCCCA AGAATTGTTC CTGG 
24 



(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 9 amino acids 
20 (B) TYPE: amino acid 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 



Lys He Met Glu Asn Lys Met Cys A Bp 
25 1 5 
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(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

5 <C) STRANDEDNESS: single 

< D ) TOPOLOGY : 1 ine ar 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

TCRCACATYT TRTTNCCCAT TATCTT 
10 26 

(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
15 (D ) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

Ala Gin Gly Leu Asn Phe Gin Lys 
1 5 

20 (2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
2 5 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 
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GCACAAG6AC TGAATTTCCA AAAG 
24 



(2) INFORMATION FOR SEQ ID NO:35: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 20 base pair a 

(B> TYPE: nucleic acid 
( C > STRAND ED NESS : s ing le 
< D ) TOPOLOGY : 1 inear 

(ii> MOLECULE TYPE: DNA 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

CTGCCTGAAG TGTTAGAAGA 
20 



(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH : 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: Bingle 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

GCTGAACTGA CATTAGAGGT G 
21 



(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 25 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 



ACCTATGAGG ACGAAAGCCA GAG AC 
5 25 



(2) INFORMATION FOR SEQ ID NO: 38: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 24 base pairs 

(B) TYPE: nucleic acid 
10 (C> STRAND EDNESS : single 

( D ) TOPOLOGY : 1 inear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 



TGTGAGCAAC TGTCCTCGAG AACT 
15 24 



(2) INFORMATION FOR SEQ ID NO: 39: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

( B ) TYPE: nucleic acid 
20 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 



GTCACGATGT CGACGTGTAC TTCTCTGAAG TAAGATGATT TG 
25 42 
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<2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 base pairs 

(B) TYPE: nucleic acid 

5 <C) STRANDED NESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

GTCAGGTCAG AAAAGCTTAT CACTCTGTGT TTTTCAATAT CATCTTG AG T GAA 
10 53 

(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 
15 (C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

AAGCTTTTCT GACCTGACNN N 
20 21 



(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 3854 base pairs 
(B) TYPE: nucleic acid 
2 5 (C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE : cDNA 
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{ ix ) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 61.. 3546 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

5 GTCGACCCAC GCGTCCGGAG GAATCGTTCT G C AAATCC AG GTGTACACCT CTGAAGAAAG 
60 

ATG ATG TGT CAG AAA TTC TAT GTG GTT TTG TTA CAC TGG GAA TTT CTT 
108 

Met Met Cye Gin Lye Phe Tyr Val Val Leu Leu His Trp Glu Phe Leu 
10 1 5 10 15 

TAT GTG ATA GCT GCA CTT AAC CTG GCA TAT CCA ATC TCT CCC TGG AAA 
156 

Tyr Val lie Ala Ala Leu Asn Leu Ala Tyr Pro lie Ser Pro Trp Lys 
20 25 30 

15 TTT AAG TTG TTT TGT GGA CCA CCG AAC ACA ACC GAT GAC TCC TTT CTC 
204 

Phe Lys Leu Phe Cys Gly Pro Pro Asn Thr Thr Asp Asp Ser Phe Leu 
35 40 45 

TCA CCT GCT GGA GCC CCA AAC AAT GCC TCG GCT TTG AAG GGG GCT TCT 
20 252 

Ser Pro Ala Gly Ala Pro Asn Asn Ala Ser Ala Leu Lys Gly Ala Ser 
50 55 60 

GAA GCA ATT GTT GAA GCT AAA TTT AAT TCA AGT GGT ATC TAC GTT CCT 
300 

2 5 Glu Ala He Val Glu Ala Lys Phe Asn Ser Ser Gly He Tyr Val Pro 
65 70 75 80 

GAG TTA TCC AAA ACA GTC TTC CAC TGT TGC TTT GGG AAT GAG CAA GGT 
348 

Glu Leu Ser Lys Thr Val Phe HiB Cys Cys Phe Gly Asn Glu Gin Gly 
30 85 90 95 

CAA AAC TGC TCT GCA CTC ACA GAC AAC ACT GAA GGG AAG ACA CTG GCT 
396 
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Gin Aen Cys Ser Ala 
100 

TCA GTA GTG AAG GCT 
444 

5 Ser Val Val Lys Ala 
115 

ATA GAG TGC TGG ATG 
492 

lie Glu Cys Trp Met 
10 130 

GAG CCA TTA CCT AAG 
540 

Glu Pro Leu Pro Lys 
145 

15 CTT TTA TAT GAT CTG 
588 

Leu Leu Tyr Asp Leu 
165 

CTG AAA GAC AGC TTT 
0 636 

Leu Lys Asp Ser Phe 
180 

TGT GAA TGT CAT GTG 
684 

5 Cys Glu Cys His Val 
195 

CTG ATG TAT TTG GAA 
732 

Leu Met Tyr Leu Glu 
0 210 

CTG ATG TCA CTG CAG 
780 

Leu Met Ser Leu Gin 
225 
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Leu Thr Asp Asn Thr Glu 
105 

TCA GTT TTT CGC CAG CTA 

Ser Val Phe Arg Gin Leu 
120 

AAA GGG GAC TTG ACA TTA 

Lys Gly Asp Leu Thr Leu 
135 

AAC CCC TTC AAG AAT TAT 

Asn Pro Phe Lys Asn Tyr 
150 155 

CCT GAA GTC ATA GAT GAT 

Pro Glu Val lie Asp Asp 
170 

CAG ACT GTC CAA TGC AAC 

Gin Thr Val Gin Cys Asn 
185 

CCG GTA CCC AGA GCC AAA 

Pro Val Pro Arg Ala Lys 
200 

ATC ACA TCT GCC GGT GTG 

lie Thr Ser Ala Gly Val 
215 

CCC ATG CTT GTT GTG AAA 

Pro Met Leu Val Val Lys 
230 235 
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Gly Lys Thr Leu Ala 
110 

GGT GTA AAC TGG GAC 

Gly Val Asn Trp Asp 
125 

TTC ATC TGT CAT ATG 

Phe lie Cys His Met 
140 

GAC TCT AAG GTC CAT 

Asp Ser Lys Val His 
160 

TCG CCT CTG CCC CCA 

Ser Pro Leu Pro Pro 
175 

TGC AGT CTT CGG GGA 

Cys Ser Leu Arg Gly 
190 

CTC AAC TAC GCT CTT 

Leu Asn Tyr Ala Leu 
205 

AGT TTT CAG TCA CCT 

Ser Phe Gin Ser Pro 
220 

CCC GAT CCA CCC TTA 

Pro Asp Pro Pro Leu 
240 
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GGT TTG CAT ATG GAA 
828 

Gly Leu His Met Glu 
245 

5 GAC AGC CAA ACA ATG 
876 

Asp Ser Gin Thr Met 
260 

TTA GAG AAT TCT ACA 
10 924 

Leu Glu Asn Ser Thr 
275 

ACA TCT CTG CTG GTA 
972 

15 Thr Ser Leu Leu Val 
290 

CAG GTG AGG AGC AAG 
1020 

Gin Val Arg Ser Lys 
20 305 

AGT TCA CCT CAA GTC 
1068 

Ser Ser Pro Gin Val 
325 

25 AAA ATT CTG ACT AGT 
1116 

Lys lie Leu Thr Ser 
340 

AAA AAC GAA AAC CAG 
30 1164 

Lys Asn Glu Asn Gin 
355 

AAT CTA GCT GAG AAA 
1212 
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GTC ACA GAT GAT GGT AAT 

Val Thr Asp Asp Gly Asn 
250 

GCA CCA TTT CCG CTT CAA 

Ala Pro Phe Pro Leu Gin 
265 

ATT GTA AGA GAG GCT GCT 

lie Val Arg Glu Ala Ala 
280 

GAC AGT GTG CTT CCT GGA 

Asp Ser Val Leu Pro Gly 
295 

AGA CTG GAT GGT TCA GGA 

Arg Leu Asp Gly Ser Gly 
310 315 

TTT ACC ACA CAA GAT GTT 

Phe Thr Thr Gin Asp Val 
330 

GTT GGA TCG AAT GCT TCT 

Val Gly Ser Asn Ala Ser 
345 

ATT ATC TCC TCA AAA CAG 

lie lie Ser Ser Lys Gin 
360 

ATC CCT GAG ATA CAG TAC 
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TTA AAG ATT TCT TGG 

Leu Lys lie Ser Trp 
255 

TAT CAG GTG AAA TAT 

Tyr Gin Val Lys Tyr 
270 

GAA ATT GTC TCA GCT 

Glu He Val Ser Ala 
285 

TCT TCA TAT GAG GTC 

Ser Ser Tyr Glu Val 
300 

GTC TGG AGT GAC TGG 

Val Trp Ser Asp Trp 
320 

GTG TAT TTT CCA CCC 

Val Tyr Phe Pro Pro 
335 

TTT CAT TGC ATC TAC 

Phe His Cys He Tyr 
350 

ATA GTT TGG TGG AGG 

He Val Trp Trp Arg 
365 

AGC ATT GTG AGT GAC 
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Asn Leu Ala Glu Lys lie Pro Glu lie Gin Tyr Ser lie Val Ser Asp 
370 375 380 

CGA GTT AGC AAA GTT ACC TTC TCC AAC CTG AAA GCC ACC AGA CCT CGA 
1260 

5 Arg Val Ser Lys Val Thr Phe Ser Asn Leu Lys Ala Thr Arg Pro Arg 
385 390 395 400 

GGG AAG TTT ACC TAT GAC GCA GTG TAC TGC TGC AAT GAG CAG GCG TCC 
1308 

Gly Lys Phe Thr Tyr Asp Ala Val Tyr Cya Cys Asn Glu Gin Ala Cys 
10 405 410 415 

CAT CAC CGC TAT GCT GAA TTA TAC GTG ATC GAT GTC AAT ATC AAT ATA 
1356 

His His Arg Tyr Ala Glu Leu Tyr Val lie Aep Val Asn lie Asn lie 
420 425 430 

15 TCA TGT GAA ACT GAC GGG TAC TTA ACT AAA ATG ACT TGC AGA TGG TCA 
1404 

Ser Cys Glu Thr Asp Gly Tyr Leu Thr Lys Met Thr Cys Arg Trp Ser 
435 440 445 

CCC AGC ACA ATC CAA TCA CTA GTG GGA AGC ACT GTG CAG CTG AGG TAT 
20 1452 

Pro Ser Thr He Gin Ser Leu Val Gly Ser Thr Val Gin Leu Arg Tyr 
450 455 460 

CAC AGG CGC AGC CTG TAT TGT CCT GAT AGT CCA TCT ATT CAT CCT ACG 
1500 

25 His Arg Arg Ser Leu Tyr Cys Pro Asp Ser Pro Ser He His Pro Thr 

465 470 475 480 

TCT GAG CCC AAA AAC TGC GTC TTA CAG AGA GAC GGC TTT TAT GAA TGT 
1548 

Ser Glu Pro Lys Asn Cys Val Leu Gin Arg Asp Gly Phe Tyr Glu Cys 
30 485 490 495 

GTT TTC CAG CCA ATC TTT CTA TTA TCT GGC TAT ACA ATG TGG ATC AGG 
1596 

Val Phe Gin Pro He Phe Leu Leu Ser Gly Tyr Thr Met Trp He Arg 
500 505 510 
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ATC AAC CAT TCT TTA GGT TCA CTT GAC TCG CCA CCA ACG TGT GTC CTT 
1644 

lie Asn His Ser Leu Gly Ser Leu Asp Ser Pro Pro Thr Cys Val Leu 
515 520 525 

5 CCT GAC TCC GTA GTA AAA CCA CTA CCT CCA TCT AAC GTA AAA GCA GAG 
1692 

Pro Asp Ser Val Val Lye Pro Leu Pro Pro Ser Asn Val Lys Ala Glu 
530 535 540 

ATT ACT GTA AAC ACT GGA TTA TTG AAA GTA TCT TGG GAA AAG CCA GTC 
10 1740 

lie Thr Val Asn Thr Gly Leu Leu Lys Val Ser Trp Glu Lys Pro Val 
545 550 555 560 

TTT CCG GAG AAT AAC CTT CAA TTC CAG ATT CGA TAT GGC TTA AGT GGA 
1788 

15 Phe Pro Glu Asn Asn Leu Gin Phe Gin lie Arg Tyr Gly Leu Ser Gly 

565 570 575 

AAA GAA ATA CAA TGG AAG ACA CAT GAG GTA TTC GAT GCA AAG TCA AAG 
1836 

Lys Glu lie Gin Trp Lys Thr His Glu Val Phe Asp Ala Lys Ser Lys 
20 580 585 590 

TCT GCC AGC CTG CTG GTG TCA GAC CTC TGT GCA GTC TAT GTG GTC CAG 
1884 

Ser Ala Ser Leu Leu Val Ser Asp Leu Cys Ala Val Tyr Val Val Gin 
595 600 60S 

25 GTT CGC TGC CGG CGG TTG GAT GGA CTA GGA TAT TGG AGT AAT TGG AGC 
1932 

Val Arg Cys Arg Arg Leu Asp Gly Leu Gly Tyr Trp Ser Asn Trp Ser 
610 615 620 

AGT CCA GCC TAT ACG CTT GTC ATG GAT GTA AAA GTT CCT ATG AGA GGG 
30 1980 

Ser Pro Ala Tyr Thr Leu Val Met Asp Val LyB Val Pro Met Arg Gly 
625 630 635 640 

CCT GAA TTT TGG AGA AAA ATG GAT GGG GAC GTT ACT AAA AAG GAG AGA 
2028 
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Pro Glu Phe Trp Arg Lys Met Asp Gly Asp Val Thr Lys Lys Glu Arg 
645 650 655 

AAT GTC ACC TTG CTT TGG AAG CCC CTG ACG AAA AAT GAC TCA CTG TGT 
2076 

5 As n Val Thr Leu Leu Trp Lys Pro Leu Thr Lys Asn Asp Ser Leu Cys 
660 665 670 

AGT GTG AGG AGG TAG GTT GTG AAG CAT CGT ACT GCC CAC AAT GGG ACG 
2124 

Ser Val Arg Arg Tyr Val Val Lys His Arg Thr Ala His Asn Gly Thr 
10 675 680 685 

TGG TCA GAA GAT GTG GGA AAT CGG ACC AAT CTC ACT TTC CTG TGG ACA 
2172 

Trp Ser Glu Asp Val Gly Aen Arg Thr Asn Leu Thr Phe Leu Trp Thr 
690 695 700 

15 GAA CCA GCG CAC ACT GTT ACA GTT CTG GCT GTC AAT TCC CTC GGC GCT 
2220 

Glu Pro Ala His Thr Val Thr Val Leu Ala Val Asn Ser Leu Gly Ala 
705 710 715 720 

TCC CTT GTG AAT TTT AAC CTT ACC TTC TCA TGG CCC ATG AGT AAA GTG 
20 2268 

Ser Leu Val Asn Phe Asn Leu Thr Phe Ser Trp Pro Met Ser Lys Val 
725 730 735 

AGT GCT GTG GAG TCA CTC AGT GCT TAT CCC CTG AGC AGC AGC TGT GTC 
2316 

25 Ser Ala Val Glu Ser Leu Ser Ala Tyr Pro Leu Ser Ser Ser Cys Val 
740 745 750 

ATC CTT TCC TGG ACA CTG TCA CCT GAT GAT TAT AGT CTG TTA TAT CTG 
2364 

lie Leu Ser Trp Thr Leu Ser Pro Asp Asp Tyr Ser Leu Leu Tyr Leu 

30 755 760 765 

GTT ATT GAA TGG AAG ATC CTT AAT GAA GAT GAT GGA ATG AAG TGG CTT 
2412 

Val lie Glu Trp Lys He Leu Asn Glu Asp Asp Gly Met Lys Trp Leu 
770 775 780 
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AGA ATT CCC TCG AAT 
2460 

Arg lie Pro Ser Asn 
785 

5 CCC ATC GAG AAA TAT 
250S 

Pro lie Glu Lys Tyr 
805 

GTT GGA AAA CCA AAG 
10 2556 

Val Gly Lys Pro Lys 
820 

AAG CAG CAG AAT GAC 
2604 

15 Lys Gin Gin Asn Asp 
835 

TCC TCT TGT GTC CTA 
2652 

Ser Ser Cys Val Leu 
20 850 

ATG AAA AAG TTG TTT 
2700 

Met Lys Lys Leu Phe 
865 

2 5 TGG GCA CAA GGA CTG 
2748 

Trp Ala Gin Gly Leu 
885 

TTT ACC AAG CAT GCA 
30 2796 

Phe Thr Lys His Ala 
900 

CCT GAA CCC ATT TCA 
2844 
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GTT AAA AAG TTT TAT ATC 

Val Lys Lys Phe Tyr lie 
790 795 

CAG TTT AGT CTT TAC CCA 

Gin Phe Ser Leu Tyr Pro 
810 

ATA ATT AAT GGT TTC ACC 

lie lie Asn Gly Phe Thr 
825 

GCA GGG CTG TAT GTC ATT 

Ala Gly Leu Tyr Val lie 
840 

CTG CTC GGA ACA CTG TTA 

Leu Leu Gly Thr Leu Leu 
855 

TGG GAC GAT GTT CCA AAC 

Trp Asp Asp val Pro ABn 
870 875 

AAT TTC CAA AAG CCT GAA 

Asn Phe Gin Lys Pro Glu 
890 

GAA TCA GTG ATA TTT GGT 

Glu Ser Val He Phe Gly 
905 

GAA GAA ATC AGT GTC GAT 
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CAC GAT AAT TTT ATT 

His Asp Asn Phe lie 
800 

GTA TTT ATG GAA GGA 

Val Phe Met Glu Gly 
815 

AAA GAT GCT ATC GAC 

Lys Asp Ala lie Asp 
830 

GTA CCC ATA ATT ATT 

Val Pro He He He 
845 

ATT TCA CAC CAG AGA 

He Ser His Gin Arg 
860 

CCC AAG AAT TGT TCC 

Pro Lys Asn Cys Ser 
880 

ACA TTT GAG CAT CTT 

Thr Phe Glu His Leu 
895 

CCT CTT CTT CTG GAG 

Pro Leu Leu Leu Glu 
910 

ACA GCT TGG AAA AAT 
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Pro Glu Pro lie Ser Glu Glu He Ser Val Aap Thr Ala Trp LyB Aan 
915 920 925 

AAA GAT GAG ATG GTC CCA GCA GCT ATG GTC TCC CTT CTT TTG ACC ACA 
2892 

5 Lys Asp Glu Met Val Pro Ala Ala Met Val Ser Leu Leu Leu Thr Thr 
930 935 940 

CCA GAC CCT GAA AGC AGT TCT ATT TGT ATT AGT GAC CAG TGT AAC ACT 
2940 

Pro Aep Pro Glu Ser Ser Ser He Cys He Ser Asp Gin Cya Aen Ser 
10 945 950 955 960 

GCT AAC TTC TCT GGG TCT CAG AGC ACC CAG GTA ACC TGT GAG GAT GAG 
2988 

Ala Aan Phe Ser Gly Ser Gin Ser Thr Gin Val Thr Cya Glu Aap Glu 
965 970 975 

15 TGT CAG AGA CAA CCC TCA GTT AAA TAT GCA ACT CTG GTC AGC AAC GAT 
3036 

Cys Gin Arg Gly Pro Ser Val Lys Tyr Ala Thr Leu Val Ser Asn Aap 

980 985 990 

AAA CTA GTG GAA ACT GAT GAA GAG CAA GGG TTT ATC CAT AGT CCT GTC 
20 3084 

Lys Leu Val Glu Thr Aap Glu Glu Gin Gly Phe He His Ser Pro Val 
99S 1000 1005 

AGC AAC TGC ATC TCC AGT AAT CAT TCC CCA CTG AGG CAG TCT TTC TCT 
3132 

25 Ser Aan Cya He Ser Ser Aan His Ser Pro Leu Arg Gin Ser Phe Ser 
1010 1015 1020 

AGC AGC TCC TGG GAG ACA GAG GCC CAG ACA TTT TTC CTT TTA TCA GAC 
3180 

Ser Ser Ser Trp Glu Thr Glu Ala Gin Thr Phe Phe Leu Leu Ser Asp 
30 1025 1030 1035 1040 

CAG CAA CCC ACC ATG ATT TCA CCA CAA CTT TCA TTC TCG GGG TTG GAT 
3228 

Gin Gin Pro Thr Met He Ser Pro Gin Leu Ser Phe Ser Gly Leu Aap 
1Q45 1050 1055 
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GAG CTT TTG GAA CTG GAG GGA AGT TTT CCT GAA GAA AAT CAC AGG GAG 
3276 

Glu Leu Leu Glu Leu Glu Gly Ser Phe Pro Glu Glu Asn His Arg Glu 
1060 1065 1070 

5 AAG TCT GTC TGT TAT CTA GGA GTC ACC TCC GTC AAC AGA AGA GAG AGT 
3324 

Lys Ser Val Cys Tyr Leu Glu Val Thr Ser Val Asn Arg Arg Glu Ser 
1075 1080 1085 

GGT GTG CTT TTG ACT GGT GAG GCA GGA ATC CTG TGC ACA TTC CCA GCC 
10 3372 

Gly Val Seu Leu Thr Gly Glu Ala Gly lie Leu Cys Thr Phe Pro Ala 

1090 1095 1100 

CAG TGT CTG TTC ACT GAC ATC AGG ATC CTC CAG GAG AGA TGC TCA CAC 
3420 

15 Gin Cys Leu Phe Ser Asp lie Arg lie Leu Gin Glu Arg Cys Ser Hia 
1105 1110 1115 1120 

TTT GTA GAA AAT AAT TTG AGT TTA GGG ACC TCT GGT GAG AAC TTT GTA 
3468 

Phe Val Glu Asn Asn Leu Ser Leu Gly Thr Ser Gly Glu Asn Phe Val 
20 1125 1130 1135 

CCT TAC ATG CCC CAA TTT CAA ACC TGT TCC ACG CAC AGT CAC AAG ATA 
3516 

Pro Tyr Met Pro Gin Phe Gin Thr Cys Ser Thr His Ser His Lys lie 
1140 1145 1150 

25 ATG GAG AAT AAG ATG TGT GAC TTA ACT GTG 
3546 

Met. Glu Asn Lys Met Cys Asp Leu Thr Val 
1155 1160 

TAATCTCATC C AAG AAG CCT CAAGGTTCCA TTCCAGTAGA GCCTGTCATG TAT AATGTG T 
30 3606 

TCTTTTATTG TTGTGGATGT GGGAGACAAG TGTCAGAATC TAGTGTGAAA ATGATTGTTT 
3666 
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CCAAACTAAG TGTGTCTATT TTCTCTCAGT AATACANATG AAACATATGA GGAAGCCCTC 
3726 

ATTAATCTAC TAATGTAGAT GGACTCTTAC TGAATATATT CCCAAGATAC TTGGGGAAGT 
3786 

5 CTCCCTAATT CTAGCTAAAA GAANTAGAAC TACTAAACAC TGAATCTGGA AAAAAAAAAA 
3846 

AAAAAAAG 
3854 



(2) INFORMATION FOR SEQ ID NO: 43: 

10 <i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1162 amino acids 

(B) TYPE: amino acid 

( D ) TOPOLOGY : u nknown 

(ii) MOLECULE TYPE: protein 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 

Met Met Cys Gin Lys Phe Tyr Val Val Leu Leu His Trp Glu Phe Leu 
15 10 15 

Tyr Val lie Ala Ala Leu Asn Leu Ala Tyr Pro lie Ser Pro Trp Lys 
20 25 30 

2 0 Phe Lys Leu Phe Cys Gly Pro Pro Asn Thr Thr Asp Asp Ser Phe Leu 
35 40 45 

Ser Pro Ala Gly Ala Pro Asn Asn Ala Ser Ala Leu Lys Gly Ala Ser 
50 55 60 

Glu Ala lie Val Glu Ala Lys Phe Asn Ser Ser Gly lie Tyr Val Pro 
25 65 70 75 80 

Glu Leu Ser Lys Thr Val Phe His Cys Cys Phe Gly Asn Glu Gin Gly 
85 90 95 



WO 97/19952 



PCT/US96/19128 



- 215 - 

Gin Asn Cys Ser Ala Leu Thr Asp Asn Thr Glu Gly Lys Thr Leu Ala 
1O0 105 no 

Ser Val val Lys Ala Ser Val Phe Arg Gin Leu Gly Val Asn Trp Asp 
115 120 125 

5 lie Glu Cys Trp Met Lys Gly Asp Leu Thr Leu Phe lie Cys His Met 
130 135 140 

Glu Pro Leu Pro Lys Asn Pro Phe Lys Asn Tyr Asp Ser Lys Val His 
145 150 155 160 

Leu Leu Tyr Asp Leu Pro Glu Val lie Asp Asp Ser Pro Leu Pro Pro 
10 165 170 175 

Leu Lys Asp Ser Phe Gin Thr Val Gin Cys Asn Cys Ser Leu Arg Gly 
180 185 190 

Cys Glu Cys His Val Pro Val Pro Arg Ala Lys Leu Asn Tyr Ala Leu 
195 200 205 

15 Leu Met Tyr Leu Glu lie Thr Ser Ala Gly Val Ser Phe Gin Ser Pro 
210 215 220 

Leu Met Ser Leu Gin Pro Met Leu Val Val Lys Pro Asp Pro Pro Leu 
225 230 235 240 

Gly Leu His Met Glu Val Thr Asp Asp Gly Asn Leu Lys lie Ser Trp 
20 245 250 255 

Asp Ser Gin Thr Met Ala Pro Phe Pro Leu Gin Tyr Gin Val Lys Tyr 
260 265 270 

Leu Glu Asn Ser Thr lie Val Arg Glu Ala Ala Glu lie Val Ser Ala 
275 280 285 

2 5 Thr Ser Leu Leu Val Asp Ser Val Leu Pro Gly Ser Ser Tyr Glu Val 
290 295 300 



Gin Val Arg Ser Lys Arg Leu Asp Gly Ser Gly Val Trp Ser Asp Trp 
305 310 315 320 
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Ser Ser Pro Gin Val Phe Thr Thr Gin Asp Val Val Tyr Phe Pro Pro 
325 330 335 

Lye lie Lou Thr Ser Val Gly Ser Asn Ala Ser Phe His Cys He Tyr 
340 345 350 

5 Lys Asn Glu Asn Gin He He Ser Ser Lys Gin lie Val Trp Trp Arg 
355 360 365 

Aon Leu Ala Glu Lys He Pro Glu He Gin Tyr Ser He Val Ser Asp 
370 375 380 

Arg Val Ser Lys Val Thr Phe Ser Asn Leu Lys Ala Thr Arg Pro Arg 
10 385 390 395 400 

Gly Lys Phe Thr Tyr Asp Ala Val Tyr Cys Cys Asn Glu Gin Ala Cys 
405 410 415 

His His Arg Tyr Ala Glu Leu Tyr Val lie Asp Val Asn He Asn He 
420 425 430 

15 Ser Cys Glu Thr Asp Gly Tyr Leu Thr Lys Met Thr Cys Arg Trp Ser 
435 440 445 

Pro Ser Thr He Gin Ser Leu Val Gly Ser Thr Val Gin Leu Arg Tyr 
450 455 460 

His Arg Arg Ser Leu Tyr Cys Pro Asp Ser Pro Ser He His Pro Thr 
20 465 470 475 480 

Ser Glu Pro Lys Asn Cys Val Leu Gin Arg Asp Gly Phe Tyr Glu Cys 
485 490 495 

Val Phe Gin Pro He Phe Leu Leu Ser Gly Tyr Thr Met Trp He Arg 
500 505 510 

2 5 He Asn His Ser Leu Gly Ser Leu Asp Ser Pro Pro Thr Cys Val Leu 
515 520 525 



Pro Asp Ser Val Val Lys Pro Leu Pro Pro Ser Asn Val Lys Ala Glu 
530 535 540 
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lie Thr Val Asn Thr Cly Leu Leu Lya Val Ser Trp Glu Lys Pro Val 

545 550 555 

555 560 

Phe Pro Glu Asn Asn Leu Gin Phe Gin lie Arg Tyr Gly Leu Ser Gly 
565 570 57S 

5 Lys Glu lie Gin Trp Lys Thr His Glu Val Phe Asp Ala Lys Ser Lys 
580 S85 590 

Ser Ala Ser Leu Leu Val Ser Asp Leu Cys Ala Val Tyr Val Val Gin 
595 600 605 

Val Arg Cys Arg Arg Leu Asp Gly Leu Gly Tyr Trp Ser Asn Trp ser 
10 «° 615 620 

Ser Pro Ala Tyr Thr Leu Val Met Asp Val Lys Val Pro Met Arg Gly 

Pro Glu Phe Trp Arg Lys Met Asp Gly Asp Val Thr Lys Lys Glu Arg 
645 650 655 



15 Asn Val Thr Leu Leu Trp Lys Pro Leu Thr Lys Asn Asp Ser 
660 665 670 



Leu Cys 



Ser val Arg Arg Tyr Val Val Lys His Arg Thr Ala His Asn Gly Thr 
675 680 6S5 



Trp Ser Glu Asp Val Gly Asn Arg Thr 
20 690 695 



Aon Leu Thr Phe Leu Trp Thr 
700 



Glu Pro Ala His Thr Val Thr Val Leu Ala 



Val Asn Ser Leu Gly Ala 



705 710 715 



Ser Leu Val Asn Phe Asn Leu Thr Phe 



Ser Trp Pro Met Ser Lys Val 



725 "° 735 

2 5 Ser Ala Val Glu Ser Leu Ser Ala Tyr Pro Leu Ser Ser Ser Cys Val 
740 745 750 

He Leu Ser Trp Thr Leu Ser Pro Asp Asp Tyr Ser Leu Leu Tyr Leu 
755 760 ?65 
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Val lie Glu Trp Lys lie Leu Asn Glu Asp Asp Gly Met Lys Trp Leu 
770 775 780 

Arg lie Pro Ser Asn Val Lys Lys Phe Tyr lie His Asp Asn Phe lie 
785 790 795 8 00 

5 Pro lie Glu Lys Tyr Gin Phe Ser Leu Tyr Pro Val Phe Met Glu Gly 
805 810 815 

Val Gly Lys Pro Lys He He Asn Gly Phe Thr Lys Asp Ala He Asp 
820 825 830 

Lys Gin Gin Asn Asp Ala Gly Leu Tyr Val He Val Pro He He He 
10 835 840 845 

Ser Ser Cys Val Leu Leu Leu Gly Thr Leu Leu He Ser His Gin Arg 
850 855 860 

Met Lys Lys Leu Phe Trp Asp Asp Val Pro Asn Pro Lye Asn Cys Ser 
865 870 875 880 

15 Trp Ala Gin Gly Leu Asn Phe Gin Lys Pro Glu Thr Phe Glu His Leu 

885 890 895 

Phe Thr Lys His Ala Glu Ser Val lie Phe Gly Pro Leu Leu Leu Glu 
900 905 910 

Pro Glu Pro lie Ser Glu Glu He Ser Val Asp Thr Ala Trp Lys Asn 
20 915 920 925 

Lys Asp Glu Met Val Pro Ala Ala Met Val Ser Leu Leu Leu Thr Thr 
930 935 940 

Pro Asp Pro Glu Ser Ser Ser He Cys lie Ser Asp Gin Cys Asn Ser 
945 950 955 960 

25 Ala Asn Phe Ser Gly Ser Gin Ser Thr Gin Val Thr Cys Glu Asp Glu 

965 970 975 



Cys Gin Arg Gly Pro Ser Val Lys Tyr Ala Thr Leu Val Ser Asn Asp 
980 985 990 
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Lys Leu Val Glu Thr Asp Glu Glu Gin Gly Phe lie Hie Ser Pro Val 
995 1000 1005 

Ser Asn Cye He Ser Ser Asn His Ser Pro Leu Arg Gin Ser Phe Ser 
1010 1015 1020 

5 Ser Ser Ser Trp Glu Thr Glu Ala Gin Thr Phe Phe Leu Leu Ser Asp 
1025 1030 1035 1040 

Gin Gin Pro Thr Met He Ser Pro Gin Leu Ser Phe Ser Gly Leu Asp 
1045 1050 1055 

Glu Leu Leu Glu Leu Glu Gly Ser Phe Pro Glu Glu Asn His Arg Glu 
10 1060 1065 1070 

Lys Ser Val Cys Tyr Leu Glu Val Thr Ser Val Asn Arg Arg Glu Ser 
1075 1080 1085 

Gly Val Seu Leu Thr Gly Glu Ala Gly He Leu Cys Thr Phe Pro Ala 
1090 1095 1100 

15 Gin Cys Leu Phe Ser Asp lie Arg lie Leu Gin Glu Arg Cys ser His 
1105 1110 1115 1120 

Phe Val Glu Asn Asn Leu Ser Leu Gly Thr Ser Gly Glu Asn Phe Val 
1125 1130 1135 

Pro Tyr Met Pro Gin Phe Gin Thr Cys Ser Thr His Ser His Lys lie 
20 1140 1145 1150 

Met Glu Asn Lys Met Cys Asp Leu Thr Val 
1155 1160 



(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 
2 5 (A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND ED NESS : double 

( D ) TOPOLOGY : unknown 
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<ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 



TCRCACATYT TRTTNCCCAT TATCTT 
16 



5 (2) INFORMATION FOR SEQ ID NOx45: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 33 base pairs 
<B) TYPE: nucleic acid 
(C) STRANDEDNESS: single 
10 (D) TOPOLOGY: linear 



(ii> MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:45: 



CCCAATGTCG ACATGATGTG TCAGAAATTC TAT 
33 



15 <2) INFORMATION FOR SEQ ID NO: 46: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 

20 <D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 



AAAAAGGATC CGGTCATTCT GCTGCTTGTC GAT 
33 



2 5 (2) INFORMATION FOR SEQ ID NO: 47: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 



WO 97/19952 



PCT/US96/19128 



- 221 - 

(C) STRANDED NESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 



5 CCCAATGTCG ACATGGTGTA CTTCTCTGAA GTA 
33 



(2) INFORMATION FOR SEQ ID NO: 48: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 30 base pairs 
10 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 



15 TTTTTGGATC CCACCTGCAT CACTCTGGTG 
30 



(2) INFORMATION FOR SEQ ID NO: 49: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 48 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



<ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:49: 



25 TTTAACTTGT CATATCCAAT TACTCCTTGG AGATTTAAGT TGTCTTGC 
48 
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(2) INFORMATION FOR SEQ ID NO: SO: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) ST RAND ED NESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 



TTTTTGGATC CCACCTGCAT CACTCTGGTG 
10 30 
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CLAIMS: 



1. An isolated nucleic acid molecule containing 
the nucleotide sequence of obR gene. 

2, An isolated nucleic acid molecule which encodes 
an Ob receptor, or a fragment thereof, having a 

5 nucleotide sequence that: 

encodes the amino acid sequence shown in FIG. 1 or 
the amino acid sequence encoded by the cDNA contained in 
cDNA clone famj5312 as deposited with the ATCC having 
accession No. 69952; or 
10 encodes the amino acid sequence shown in FIG. 6; or 

encodes the amino acid sequence shown in FIG. 3 or 
the amino acid sequence contained in cDNA clone fahj53l2d 
as deposited with the ATCC having accession No. 69963, or 
in genomic clone h-obR-p87 as deposited with the ATCC; or 
15 hybridizes under stringent conditions to the 

nucleotide sequence of (a) , (b) or (c) or to its 
complement. 



3. An isolated nucleotide sequence encoding a 
polypeptide corresponding to the extracellular, 
2 0 transmembrane or cytoplasmic domain of the Ob receptor 
protein, or a deletion mutant of the Ob receptor protein 
in which the transmembrane domain or the cytoplasmic 
domain is deleted. 



4 . An isolated nucleotide sequence encoding a 
2 5 chimeric protein comprising the polypeptide of Claim 3 
fused to a heterologous polypeptide. 



5. The isolated nucleotide sequence of Claim 4 in 
which the heterologous polypeptide is a constant region 
of an immunoglobulin. 
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6. A nucleotide vector containing the nucleotide 
sequence of Claim 1, 2, 3 , 4 or 5 . 

7. An expression vector containing the nucleotide 
sequence of Claim 1, 2, 3 , 4 or 5 in operative 

5 association with a nucleotide regulatory sequence that 
controls expression of the nucleotide sequence in a host 
cell. 

8. The expression vector of Claim 7, in which said 
regulatory sequence is selected from the group consisting 

0 of the cytomegalovirus hCMV immediate early gene, the 
early or late promoters of SV4 0 adenovirus, the lac 
system, the trp system, the TAC system, the TRC system, 
the major operator and promoter regions of phage X f the 
control regions of fd coat protein, the promoter for 

5 3-phosphoglycerate kinase, the promoters of acid 

phosphatase, and the promoters of the yeast a-mating 
factors. 

9. A genetically engineered host cell that 
contains the nucleotide sequence of Claim 1, 2, 3, 4 or 

0 5. 

10. A genetically engineered host cell that 
contains the nucleotide sequence of Claim 1, 2, 3, 4 or 5 
in operative association with a nucleotide regulatory 
sequence that controls expression of the nucleotide 

5 sequence in the host cell. 

11. The genetically engineered host cell of Claim 
10 in which the host cell is a fibroblast, a Chinese 
hamster ovary cell, a COS cell, or a VERO cell, a 
hypothalamic cell or a choroid plexus cell. 
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12. An isolated Ob receptor protein. 

13. An isolated Ob receptor protein having the 
amino acid sequence shown in FIG. 1, 3 or 6 f or the amino 
acid sequence encoded by the cDNA contained in cDNA clone 

5 famj5312 as deposited with the ATCC having accession No. 
69952, or the amino acid sequence encoded by the cDNA 
contained in cDNA clone fahj5312d as deposited with the 
ATCC having accession No. 69963, or the amino acid 
sequence encoded by the genomic clone h-obR-p87 as 

0 deposited with the ATCC- 

14 . A polypeptide having an amino acid sequence 
corresponding to the extracellular, transmembrane or 
cytoplasmic domain of Ob receptor protein, or a deletion 
mutant of the Ob receptor protein in which the 

5 transmembrane domain or the cytoplasmic domain is 
deleted . 

15. A chimeric protein comprising the polypeptide 
of Claim 14 fused to a heterologous polypeptide. 

16. The chimeric protein of Claim 15 in which the 
0 heterologous polypeptide is a constant region of an 

immunoglobulin . 

17. An antibody that immunospecif ically binds the 
Ob receptor protein of Claim 12 or 13. 

18. An antibody that immunospecif ically binds the 
5 polypeptide of Claim 14. 

19. A method for diagnosing body weight disorders 
in a mammal comprising measuring obR gene expression in a 
patient sample. 
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measured 

21. The method of Claim 19 in which expression is 
measured by detecting the obR gene product. 

5 22. A method for diagnosing body weight disorders 

in a mammal, comprising detecting an obR gene mutation 
contained in the genome of the mammal. 

23. A method for screening compounds useful for the 
treatment of body weight disorders, comprising contacting 

10 a compound with a cultured host cell that expresses the 
obR gene, and detecting a change in the expression of the 
obR gene, a change in activity of the obR gene product 
expressed by the cultured cell, or a change in tyrosine 
phosphorylation of a host cell protein, or a change in 

15 ion flux in the host cell. 



- 226 - 

The method of Claim 19 in which expression is 
by detecting mRNA transcripts of the obR gene. 



24. The method of Claim 23 in which expression of 
the obR gene is detected by measuring mRNA transcripts of 
the obR gene. 

25. The method of Claim 2 3 in which expression of 
20 the obR gene is detected by measuring Ob receptor 

protein. 

26. The method of Claim 23 in which tyrosine 
phosphorylation of host cell protein is assayed using an 
anti-phosphotyrosine antibody . 

25 27. A method for treating a low body weight 

disorder in a mammal, comprising administering a compound 
to the mammal in an amount sufficient to inhibit 
activation of the Ob receptor by endogenous Ob. 
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28. The method of Claim 27 in which the low body 
weight disorder is anorexia, cachexia, bulimia, AIDS- 
related wasting or cancer-related wasting* 

29. The method of Claim 27 in which the compound is 
5 delivered to the hypothalamus or the choroid plexus. 

30. The method of Claim 27 in which the compound is 
an antagonist that binds to the Ob receptor and inhibits 
activation of the receptor. 

31- The method of Claim 27 in which the compound 
10 binds to endogenous Ob and neutralizes Ob activity* 

32. The method of Claim 31 in which the compound is 
a polypeptide corresponding to the extracellular domain 
of the Ob receptor or a portion of the extracellular 
domain that binds Ob, a deletion mutant Ob receptor 
15 protein lacking the transmembrane or cytoplasmic domain, 
or a chimeric fusion protein comprising the extracellular 
domain of the Ob receptor, or a portion of the 
extracellular domain that binds Ob, or a transmembrane 
deletion mutant fused to a heterologous polypeptide. 

2 0 33. The method of Claim 3 2 in which the 

heterologous polypeptide of the chimeric fusion protein 
is the constant region of an immunoglobulin. 

34. The method of Claim 32 or 33 in which the 
compound is delivered to the mammal by administering a 

2 5 genetically engineered host cell that expresses and 

secretes the polypeptide or fusion protein in the mammal. 

35. The method of Claim 31 in which the compound is 
an anti-idiotypic antibody, or an Fab portion thereof, 
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that mimics the extracellular domain of the Ob receptor 
and neutralizes endogenous Ob. 

36. A method for treating a low body weight 
disorder in a mammal, comprising administering a compound 

5 to the mammal in an amount sufficient to inhibit 
expression of the Ob receptor in vivo . 

37. The method of Claim 36 in which the low body 
weight disorder is anorexia, cachexia, bulimia, AIDS- 
related wasting or cancer-related wasting. 

10 38. The method of Claim 36 in which the compound is 

delivered to the hypothalamus or the choroid plexus. 

39. The method of Claim 36 in which the compound is 
an antisense oligonucleotide that inhibits translation of 
raRNA transcripts that encode the Ob receptor. 

15 40. The method of Claim 36 in which the compound is 

a ribozyme that inhibits translation of xoRNA transcripts 
that encode the Ob receptor. 

41. The method of Claim 36 in which the compound is 
an oligonucleotide that forms a triple helix with the 

20 regulatory region of the Ob receptor gene and inhibits 
transcription. 

42. The method of Claim 36 in which the compound is 
a recombinant DNA construct that inactivates the Ob 
receptor gene or its regulatory region via targeted 

25 homologous recombination. 

43. A method for treating a low body weight 
disorder in a mammal, comprising administering a compound 
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to the mammal in an amount sufficient to inhibit signal 
transduction induced by binding of endogenous Ob to the 
Ob receptor. 

44. The method of Claim 43 in which the low body 
5 weight disorder is anorexia, cachexia, bulimia, AIDS- 

related wasting, or cancer-related wasting. 

45. The method of claim 43 in which the compound is 
delivered to the hypothalamus or the choroid plexus. 

46. The method of Claim 4 3 in which the compound 
10 inhibits the activity of an intracellular mediator of Ob 

receptor-induced signal transduction . 

47. The method of Claim 46 in which the compound 
inhibits a tyrosine kinase or a tyrosine phosphatase. 

48. The method of Claim 43, 44 or 45 in which the 
15 compound is an oligonucleotide construct encoding a 

signalling-incompetent Ob receptor controlled by a 
regulatory sequence that directs the expression of the 
signalling-incompetent receptor in target cells in the 
body. 

20 49. The method of Claim 48 in which the 

oligonucleotide construct encodes a signalling- 
incompetent deletion mutant of the Ob receptor in which 
all or a portion of the cytoplasmic domain is deleted. 

50. A method for treating obesity in a mammal, 
25 comprising administering a compound to a mammal in an 
amount sufficient to up regulate expression of a 
functional Ob receptor in the mammal. 
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51. The method according to Claim 50 in which the 
compound is delivered to the hypothalamus or the choroid 
plexus . 



52. The method of Claim 50 or 51 in which the 
5 mammal expresses a defective Ob receptor and the compound 
comprises a nucleotide construct encoding a functional Ob 
receptor controlled by a regulatory region that directs 
expression of the functional receptor in target cells in 
the mammal. 



10 53 • The method of Claim 50 or 51 in which the 

mammal expresses a mutant Ob receptor and the compound 
comprises a nucleotide construct encoding a wild-type Ob 
receptor that corrects the endogenous mutation via 
targeted homologous recombination, 

15 54 • An isolated nucleic acid molecule encoding an 

Ob receptor, said nucleic acid molecule having a 
nucleotide sequence that encodes amino acids 1 to 8 68 of 
the amino acid sequence shown in Fig. 3. 

55. An isolated nucleic acid molecule encoding an 
2 0 Ob receptor, said nucleic acid molecule having a 

nucleotide sequence that encodes amino acids l to 965 of 
the amino acid sequence shown in Fig. 3. 

56. An isolated nucleic acid molecule encoding an 
Ob receptor, said nucleic acid molecule having a 

2 5 nucleotide sequence that encodes amino acids l to 1065 of 
the amino acid sequence shown in Fig. 3. 

57 . An isolated nucleic acid molecule encoding an 
Ob receptor, said nucleic acid molecule having a 
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nucleotide sequence that encodes amino acids l to 1115 of 
the amino acid sequence shown in Fig. 3. 

58 . An isolated nucleic acid molecule encoding an 
Ob receptor which can induce IL-6RE mediated gene 
5 expression. 



59. An isolated nucleic acid molecule encoding an 
Ob receptor which can induce HRRE mediated gene 
expression, 

60. The isolated nucleic acid molecule of any of 
o claims 54-57 , said molecule encoding an amino acid 

sequence having at least 9 0% identity to the amino acid 
sequence of Fig. 3. 

61. The isolated nucleic acid molecule of claim 59, 
said molecule encoding an amino acid sequence having at 

5 least 90% identity to the amino acid sequence of amino 
acids 1 to 965 of Fig. 3. 

62 . A method for evaluating whether a test agent is 
a candidate agent for treatment of body weight disorders, 
comprising: 

0 (a) exposing eukaryotic cells expressing a gene 

encoding an ObR polypeptide to said test agent; and 

(b) measuring the expression of said gene by said 
eukaryotic cells in the presence of said test agent; 

wherein said test agent is identified as a candidate 
5 agent for treatment of body weight disorders when the 
expression of said gene in the presence of said test 
agent differs from the expression of said gene by said 
eukaryotic cells in the absence of said test agent. 
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63. The method of claim 62 further comprising 
measuring the expression of said gene by said eukaryotic 
cells in the absence of said agent. 

64 . A method for evaluating whether a test agent is 
5 a candidate agent for treatment of body weight disorders, 

comprising: 

(a) exposing eukaryotic cells expressing a gene 
encoding an ObR polypeptide to said test agent; and 

(b) measuring the binding of said test agent to said 
10 eukaryotic cells; 

wherein said agent is identified as a candidate 
agent for treatment of body weight disorders when the 
binding of said test agent to said eukaryotic cells 
differs from the binding of said test agent to control 
15 eukaryotic cells which do not express said gene, said 
control eukaryotic cells being otherwise identical to 
said eukaryotic cells expressing said gene. 

65- The method of claim 64 further comprising 
measuring the binding of said test agent to said control 
20 eukaryotic cells. 

66. A method for evaluating whether a test agent is 
a candidate agent for treatment of body weight disorders, 
comprising: 

(a) exposing an obR polypeptide to said test agent; 

2 5 and 

(b) measuring the binding of siad agent to said obR 
polypeptide ; 

wherein said test agent is identified as a candidate 
agent for treatment of body weight disorders when said 

3 0 test agent selectively binds said obR polypeptide. 
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67. The method of claim 62, 64, or 66 wherein said 
obR polypeptide comprises the cytoplasmic domain of obR. 

68. The method of claim 67 wherein said cytoplasmic 
domain comprises amino acids 861 to 1165 of FIG. 3. 

5 69. The method of claim 67 wherein said obR 

polypeptide further comprises the extracellular domain of 
obR. 

70. The method of claim 66 wherein said obR 
polypeptide is fused to a detectable label. 

10 71. The method of claim 62 or 64 wherein said 

euJcaryotic cell is a choroid plexus cell. 

72 . The method of claim 62 or 64 wherein said gene 
is a recombinant gene. 
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IPLLSOVTMWIRtNHSLCSL 520 

ATC TTT CTA TTA TCT GCC TAT ACA ATC TGC ATC ACC ATC AAC CAT TCT TTA COT TCA CTT 1560 

DSPPTCVLPOSVVKPLPPSN 540 

GAC TOC CCA CCA ACC TCT OTC CTT CCT GAC TCC CTA OTA AAA CCA CTA OCT CCA TCT AAC 1620 

VXAEITVNTGLLXVSWEKPV S60 

CTA AAA CCA CAG ATT ACT GTA AAC ACT CCA TTA TTC AAA CTA TCT TGC GAA AAC CCA CTC 1680 

FPENNLQFQIR^GLSCXEIQ.' 5B0 

TTT CCC GAG AAX AAC CTT CAA TTC CAG ATT CCA TAT CCC TTA ACT CCA AAA GAA ATA CAA 1740 

WXTHEVFDAXSKSASLLVSD 600 

TGC AAC ACA CAT CAC CTA TTC CAT CCA AAC TCA AAC TCT CCC AOC CTC CTC CTC TCA CAC 1800 

LCAVYVVQVP. CRRLDCLCVW -620 

CTC TCT CCA CTC TAT CTC CTC CAC GTT CCC TCC CCC CCC TTC CAT CCA CTA CCA TAT TCC 1960 

SWWSSPAYTLVMOVKVPKRC 640 

ACT AAT TCC ACC ACT CCC 'TAX CTT CTC ATC CAT CTA AAA CTT CCT ATC ACA CCC 1520 

P Z F W RXMOCOVTKK2RNVT L 660 

CCT GAA TTT TCC ACA AAA ATC GAT GGS GAC CTT ACT AAA AAC CAC ACA AAX CTC ACC TTC 1580 

i*JC.PX.XX2*DSLCSVRRYVVK 680 

CTT TCC AAC CCC CTC AOS AAA AAT CAC TCA CTC TCT ACT CTC ACC ACC TAC GTT GTC AAC 2040 

HRT ARWGTWSEOVGNRTNLT 700 

CAT CCT ACT GCC CAC AAT GGG ACS TCC TCA CAA CAT CTC CCA AAT CCC ACC AAT CTC ACT 2100 

f'Lwtzpaktvtvlavmslga 720 

TTC CTC TCC ACA CAA CCA CCC CAC ACT CTT ACA GTT CTC CCT OTC AAT TCC CTC GCC GCT 2160 

SLV NFUX.TFSWPMSSVSAVE ICO 

TCC CTT GTG AAT TTT AAC CTT ACC TTC TCA TCC CCC ATC ACT AAA CTC ACT CCT GTC GAG 2220 

SLS AYPLSSSCVILSWTLS P 760 

TCA CTC ACT GCT TAT CCC CTC ACC ACC ACC TCT CTC ATC CTT TCC TCC ACA CTC TCA CCT 2290 

DDY SLLYLVIEWKIX.KHDDG 780 

CAT CAT TAT ACT CTC TTA TAT CTC GTT ATT CAA TCC AXG ATC CTT AAT CAA CAT CAT CCA 2 J 40 

KKWLRZPSUVXxrriKOKFI 800 

ATC AAC TCC CTT ACA ATT CCC TCC AAT CTT AAA AAC TTT TAT ATC CAC CAT AAT TTT ATT 2400 

F I E K Y Q FSLYPVfMEGVGK P 820 

CCC ATC CAC AAA TAT CAG TTT AGT CTT TAC CCA CTA TTT ATC GAA CCA CTT CCA AAA CCA 2460 
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KI INGFTKDAIDKQONDAGL 640 

AAC ATA ATT AAT OCT TTC ACC AAA CAT CCT ATC CAC AAC CAC CAC AAT CAC CCA CCC CTC 2S20 

YVIVP I irSSCVLLt.CTt.L2 8(0 

TXT CTC ATT CTA CCC ATA ATT ATT TCC TCT TCT CTC CTA CTC CTC CCA ACA CTC TEX ATT 2500 

S H Q R M„. JC JC LFWDDVPNPKHCS 880 

TCA CAC CAC ACA ATC AAA AAC TTC TTT TCC GAC CAT CTT CCA AAC CCC AAC AAT TOT TCC 2540 

WAQCLNFQKPETFEHLFTKK 900 

TCC CCA CAA CCA CTC AAT TTC CAA AAC CCT GAA ACA TTT CAC CAT CTT TTT ACC AAC CAT 2700 

AZSVI FCPL1L EPEPISEEI 920 

CCA CAA TCA CTC ATA TTT CCT CCT CTT CTT CTC CAC CCT CAA CCC ATT TCA CAA CAA ATC 2750 

SVDTANKNXDEM VPAAHVSL 940 

AC7T OTC CAT ACA CCT TCC AAA AAT AAA CAT CAC ATC CTC CCA CCA CCT ATC CTC TCC CTT 2820 

LLTTFDPESSSICISDQCNS 9SQ 

CTT TTC ACC ACA CCA CAC CCT CAA ACC ACT TCT ATT TCT ATT ACT CAC CAC TCT AAC ACT 2880 

ANFSGSQSTQVTCEDECQRQ 99 q 

CCT AAC TTC TCT CCC TCT CAC ACC ACC CAC CTA ACC TCT CAC CAT GAG TOT CAG ACA CAA ' 2940 

P SVXVATLVSMDKLVETDZE 1000 

CCC TCA CTT AAA TAT CCA ACT CTC CTC ACC AAC CAT AAA CTA CTC CAA ACT CAT CAA GAC 3000 

QGFIKSPVSNCrSSN KSPLR 1020 

CAA CCC TTT ATC CAT ACT CCT CTC ACC AAC TCC ATC TCC ACT AAT CAT TCC CCA CTC ACC 3060 

Q S P SS SSW ETEAQTPFLLSD 1040 

CAC TCT TTC TCT ACC ACC TCC TCC CAC ACA CAC CCC CAC ACA TTT TTC CTT TTA TCA CAC 3120 

Q QPTMISPQLSFSCLDEtLE 1050 

CAC CAA CCC ACC ATC ATT TCA CCA CAA CTT TCA TTC TCC CCC TTC CAT CAC CTT TTC CAA 3180 

L EC S T PEENHREKSVCYLCV 1080 

CTC CAG CCA.JfcfiX JTX CCT GAA GAA AAT CAC ACC CAC AAC TCT CTC TCT TAT CTA CCA CTC 3240 

TSVNP. RESCV LLTCSAGILC 1100 

ACC TCC CTC AAC ACA ACA CAC ACT CCT CTC CTT TTC ACT CCT CAC CCA CCA ATC CTC TCC 3300 

Tr P A. Q CLF S CIRI LQERC SH 1120 

ACA TTC CCA CCC CAC TOT CTC TTC ACT CAC ATC ACC ATC CTC CAG GAG AGA TCC TCA CAC 3360 

FVZNMLSLGTSGENFVPYMP 1140 

TTT CTA CAA AAT AAT TTC ACT TEA CCC ACC TCT CCT C%C AAC TTT CTA CCT TAC ATC CCC 3420 

QFQTCSTHSKKIMENKMCDL 1150 

CAA TTT CAA ACC TCT TCC ACC CAC ACT CAC AAC ATA ATC CAC AAT AAC ATC TCT GAC TTA 3430 

T V • 1162 

ACT CTC TAA 3439 

lCTCA TCCAAGAAGCCTCAAG ^ i-l* 1 11A1 lUU^ ATGTGC 

CJKCACAAGTCT CA CAATCTA GTC^ IT.G ^^ACTAA S TOI I7 TCT ATl I 11 Iir^ AGTAATACANATGA 

AACATATCJU»U*GC e CTCAT^ I tXO CAAGT 

C TCCCT A ATTCTACCTAAAAATAAACCCA 
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